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CHAPTER.  I 


INTRODUCTION 

Background 

Reparable  generations  forecasts  are  used  in  managing 
the  USAF  recoverable  item  inventory.  The  reparable  genera- 
tions forecasts  are  computed  by  the  Recoverable  Consumption 
Item  Requirements  System  (D041)  and  are  interfaced  with 
various  Management  of  Items  Subject  to  Repair  (MISTR)  sys- 
tems (9:1-1).  There  are  three  data  systems  which  form  the 
nucleus  of  the  mechanized  portion  of  MISTR:  (1)  Repair 

Requirements  Computation  System  (D073), (2)  Requirements 
Scheduling  and  Analysis  System  (G019C) , and  (3)  Contracts 
Schedules  and  Analysis  System  (G019F)  (8:2-1).  These  sys- 

tems interface  in  such  a manner  that  forecast  errors  can 
cause  substantial  difficulty  in  planning  depot  level  main- 
tenance (2;  3;  4). 

The  D041  System 

The  D041  system  is  used  to  identify  from  125,000 
line  items,  those  requiring  buy,  repair,  termination,  and/ 
or  disposal  actions  (9:1-1).  Repair  requirements  are  com- 
puted and  projected  by  stock  numbers  within  family  groups 
and  are  fed  into  the  D073  repair  requirements  system.  The 
D041  computations  are  made  quarterly.  The  first  quarterly 
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computation  (FY-1)  identifies  items  requiring  buy,  repair, 
termination,  and  disposal  action  (9:1-1).  The  second 
quarterly  computation  (FY-2)  is  used  for  adjusting  pre- 
viously initiated  item  actions,  updating  workload  projec- 
tions for  annual  negotiation,  buy  actions  in  process, 
termination  and  disposal  (9:1-1).  The  third  quarterly  com- 
putation (FY-3)  is  used  to  develop  budget  requests  and 
update  logistic  actions  in  process  (9:1-1).  The  fourth 
quarterly  computation  (FY-4)  is  used  to  update  logistics 
actions  in  process,  initiate  new  actions,  and  provide  data 
input  into  other  systems  (9:1-1).  The  D041  system  applies 
to  all  Air  Logistic  Centers  (ALCs) . It  is  used  to  manage 
expendable  investment  recoverable  spares;  to  develop  and 
adjust  requirements  data,  including  maintenance  factors; 
and  to  compute  recoverable  consumption  item  requirements 
(9:1-1) . 

The  MISTR  Systems 

The  DO 73  system  provides:  "a  quarterly  forecast 
for  planning  and  negotiation  purposes  and  a biweekly  short- 
range  computation  which  acts  as  a workloading  system  based 
on  the  quantity  negotiated  [8:2-1]."  The  G019C  system 
"interfaces  with  the  D073  system  and  is  the  vehicle  by 
which  items  are  scheduled  into  and  production  is  reported 
from  organic  SRAs  (Specialized  Repair  Activities)  [7:2-1]." 
The  G019F  system  "performs  a similar  function  for  contract 
sources  [8:2-1]." 
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The  objectives  of  MISTR  are  to: 

(1)  Project  quarterly  repair  quantities  on  a long- 
range  basis  (current  and  next  fiscal  year) . 

(2)  Negotiate  with  organic,  contractual,  or  inter- 
service agencies  for  all  funded  requirements. 

(3)  Workload  maintenance  facilities  based  on  nego- 
tiated quantities. 

(4)  Provide  necessary  tools  to  facilitate  optimum 
internal  depot  repair  scheduling  through  portrayal 
of  asset  and  component  availability. 

(5)  Provide  automatic  requisitioning  of  current  and 
longer  range  component  part  requirements. 

(6)  Provide  operating  and  management  personnel  the 
necessary  tools  to  monitor  scheduling  and  repair 
progress  through  preparation  and  analysis  of  a 
complete  range  of  management  products. 

(7)  Provide  optimum  communication  channels  and 
operating  frequency  between  the  customer  and  the 
repair  activity  [8:2-1]. 


The  Systems  Interface 

According  to  AFLCR  65-12  (Reference  Figure  1.1)  the 
requirements  computed  by  D041  for  inclusion  in  MISTR  are 
projected  by  actual  stock  number  and  are  mechanically  fed 
into  the  prime  Item  Manager's  (IM)  D073  requirement  system. 
The  long-range  repair  requirements  include  the  annual 
requirements  computed  by  the  D041  requirements  system  and 
are  provided,  by  quarter,  to  the  D073  MISTR  system.  Prior 
to  negotiating  the  long-range  requirement,  the  requirement 
is  reviewed  for  accuracy  and  completeness  and  to  insure  all 
technical  data  are  available  and  current.  The  quarterly 
repair  requirement  is  then  negotiated  between  the  IM/ALC, 
and  a preassigned  source  of  repair  (SOR) . These  are  yearly 
requirements  at  the  beginning  of  the  fiscal  year,  but  are 
negotiated  quarterly.  The  act  of  negotiation  aligns  the 
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requirements,  as  computed  and  reflected  by  the  D041  and 
D073  systems,  with  the  available  funds  and  manhours.  Vari- 
ous systems  and  methods  are  used  to  predict  the  avail- 
ability of  assets,  parts,  skills,  support  equipment,  and 
other  resources.  The  X-21  document,  which  is  provided  to 
the  IM,  reflects  eight  quarters  of  repair  requirements  as 
they  have  been  computed  by  the  D041  system  (5). 

Statement  of  the  Problem 

The  D04 1 and  D073  systems  interface  in  such  a manner 
that  reparable  generation  forecast  errors  can  cause  sub- 
stantial difficulty  in  planning  depot  level  maintenance  and 
the  current  forecasting  methods  may  not  present  the  most 
accurate  forecasts  (2;  3;  4).  Such  a possibility  suggests 
that  the  differences  between  actual  reparable  generations 
and  those  forecasted  through  use  of  the  D041  method  need  to 
be  analyzed  and  compared  with  the  differences  that  would 
exist  if  other  forecasting  methods  were  used. 

Research  Justification 

The  prime  responsibility  of  the  Air  Force  Logistics 
Command  is  to  insure  that  combat  units  of  the  Air  Force 
have  the  right  equipment  at  the  right  time  and  place  (8:22). 
To  accomplish  this  mission,  AFLC  "...  must  maintain  the 
ability  to  replenish  base  stock  levels  through  a constant 
flow  of  recoverable  material  to  and  from  our  global  deployed 
Air  Force  units  [8:2-2]."  It  is  in  the  best  interest  of 
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the  USAF  to  manage  the  recoverable  item  inventory  as  effi- 
ciently as  possible.  The  MISTR  system  is  designed  to 
manage  items  subject  to  repair,  and  its  success  depends 
upon  the  accuracy  of  preproduction  planning  (8:2-2). 

The  D041  and  D073  systems,  which  form  the  heart  of 
the  preproduction  planning  process,  are  used  in  the  manage- 
ment of  recoverable  item  inventories  (1) . The  question 
addressed  in  this  thesis  is  "What  is  the  significance  to 
production  planning  of  the  difference  between  the  forecasted 
and  the  actual  reparables  generated  (3)?"  An  understand- 
ing of  the  D041  computational  model  will  help  in  analyzing 
the  significance  of  that  difference.  After  describing  the 
model,  it  will  be  possible  to  make  statistical  comparisons 
with  the  forecast  errors  generated  by  other  predictive 
methods.  This  knowledge  will  enhance  the  understanding  of 
the  D041  inventory  management  system  and  ma^  indicate  areas 
in  which  improvements  can  be  made. 

Areas  of  Investigation 

There  are  three  areas  of  investigation  that  relate 
to  the  impact  of  D041  forecasting  errors  upon  the  depot 
maintenance  function: 

1.  The  method  used  to  forecast  reparable  genera- 
tions by  the  D041  system.  How  do  these  forecasts  compare 
with  actual  reparable  generations?  What  is  the  signifi- 
cance of  the  differences  found  in  the  comparison?  Would 
other  predictive  methods  be  more  accurate? 


imz — ^ 
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2.  The  negotiation  process.  What  requirements  are 
sacrificed  due  to  the  forecast  limitations  of  manhours  and 
facilities? 


3.  The  production  agency.  How  are  available  man- 
hours, facility  time,  and  spare  parts  availability  in  the 
maintenance  and  supply  systems  computed?  How  do  these 
forecasts  compare  with  actual  availability?  What  is  the 
significance  of  the  differences  found  in  the  comparison? 


The  volume  of  data  available  in  these  areas  is  enormous 
and  comparisons  of  data  in  the  past  have  not  been  made  with 
meaningful  analysis  (2;  3;  4;  5).  The  focus  of  thxs 
research  was  upon  the  first  of  these  three  areas.  Although 
the  research  did  not  focus  upon  the  negotiation  and  the  pro- 
duction processes,  it  is  necessary  to  understand  these 
processes  since  they  are  an  integrated  portion  of  the  inven- 


tory management  system (3). 


The  Negotiation  Process 

The  negotiation  process  aligns  the  requirements, 
as  computed  and  reflected  by  the  D041  and  D073  systems, 
with  the  available  funds  and  manhours  (8:2-3).  The  negotia- 
tions are  conducted  between  the  IM/ALC,  and  a preassigned 
organic  SOR.  Negotiated  quantities  are  to  be  a statement 
of  the  IM's  most  essential  requirements  limited  by  the  most 
recent  dollar/manhour  constraints  (5).  A screening  process 
is  accomplished  by  both  sides  prior  to  negotiation  to 


determine  the  best  mix  of  requirements  and  capabilities 
(5) . Forecast  errors  by  the  D041  system  can  impact  upon 
the  accuracy  of  the  negotiation  effort  (see  Figure  1.2). 

(3;  5).  Since  the  negotiation  process  is  based  upon  pro- 
jections, it  is  sometimes  necessary  to  renegotiate  in  order 
to  maintain  supportable  items.  Factors  such  as  asset 
availability,  parts  availability,  demand  rates,  modifica- 
tions, or  other  production- limiting  factors  may  necessitate 
a renegotiation  (5). 

The  Production  Process 

Inaccurate  forecasts  will  lead  to  inaccurate  nego- 
tiations (3;  5) . Inaccurate  negotiations  will  have  a nega- 
tive impact  upon  the  production  function  (5)  (see  Figure  1 . 3).. 
For  example,  a negotiation  which  establishes  the  number  of 
units  to  be  repaired  at  a level  higher  than  necessary 
causes  the  forecasts  of  parts  required  to  accomplish  the 
repair  to  be  too  high.  The  Directorate  of  Distribution 
(D/DS)  is  required  to  allocate  these  parts  for  the  negotia- 
ted number  of  units  to  be  repaired  (3;  5) . Since  the 
organic  SOR  is  incapable  of  meeting  the  negotiated  level, 
not  all  of  the  parts  will  be  used.  This  results  in  the  mis- 
allocation  of  operating  funds  by  D/DS  (5) . This  misalloca- 
tion  of  funds  reduces  the  amount  of  funds  available  for 
requisitioning  other  parts  necessary  to  accomplish  the 
repair  of  other  line  items  (3;  5).  This  in  turn  decreases 
the  production  of  other  line  items  requiring  repair. 
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Fig.  1.3.  The  Impact  of  Inaccurate  Nego 
tiations  of  the  Supply  System 
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If  the  number  of  units  negotiated  is  too  low,  the  number 
of  parts  forecasted  as  necessary  to  accomplish  the  repair 
will  be  too  low.  The  actual  production  of  a line  item 
would  be  decreased  because  of  a shortage  of  repair  parts 
(5)  . 

Forecast  Method 

The  preceding  section  discussed  the  negotiation  and 
production  processes,  the  impact  of  inaccurate  negotiations 
on  the  supply  system  and  production,  and  identified  fore- 
cast errors  as  one  of  the  major  factors  affecting  negotia- 
tions. This  section  is  concerned  with  the  forecast  method 
of  the  D041  system  and  its  related  forecasting  errors. 

The  forecasting  method  and  procedures  found  in  the 
current  directives  are  typically  translations  of  the  under- 
lying mathematical  model  of  the  technique  into  procedural 
instructions  (7).  The  instructions  are  provided  to  the  IM 
and  other  Materiel  Management  (MM)  agencies  for  use;  and, 
over  a period  of  time,  these  instructions  have  become  the 
focus  of  attention.  The  underlying  model  has  been  sup- 
pressed. As  long  as  the  technique  provides  reasonably 
accurate  forecasts,  no  particular  management  attention  is 
focused  on  it.  However,  when  the  forecast  errors  are  large, 
all  levels  of  management  must  answer  the  question,  "What 
significance  is  there  in  the  difference  between  the  fore- 
casted and  actual  reparables  generated  (3)?" 


Research  Question 


What  characteristics  are  exhibited  by  the  under- 
lying model  used  in  the  D041  inventory  management  system 
to  generate  reparable  generation  forecasts? 

Research  Hypotheses 

1.  The  expected  value  of  the  distribution  of  the 
forecast  errors  presently  found  in  the  D041  system  is 
not  zero,  i.e.,  that  the  forecast  is  biased. 

2.  Other  predictive  methods  for  inventory  manage- 
ment might  exhibit  less  bias  than  the  D041  forecasting 
technique. 


CHAPTER  II 


RESEARCH  METHODOLOGY 
Population  of  Interest 

This  research  effort  focused  upon  an  examination  of 
some  of  the  characteristics  of  the  D041  forecasting  method. 
The  D041  system  makes  quarterly  forecasts  for  125,000 
master  line  items  (2) . Recoverable  item  data  currently  in 
the  D041  system  could  not  be  made  available  in  the  required 
format  in  time  to  use  in  this  research.  Artificial  data 
patterns  were  generated  using  random  number  generators  of 
the  Time  Sharing  CREATE  Computer  System.  Five  patterns 
were  generated.  Each  pattern  consisted  of  120  data-points 
(the  equivalent  of  120  months  of  demand  history)  and  each 
pattern  was  begun  using  the  same  SEED  (12345)  so  that  the 
process  could  be  replicated  if  needed. 

Definition  of  Variables 

Within  the  complex  D041  forecasting  environment, 
many  variables  interact  to  influence  the  depot  reparable 
generations  forecast.  These  variables  include  the  subjec- 
tive variables  controlled  by  the  maintenance  manager  and 
random  variables  introduced  by  nature. 

Under  procedures  established  by  AFLCM  57-3, 
periodic  adjustments  of  these  subjective  variables  are  made 
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to  compensate  for  unpredicted  variations  and  errors  in  the 
D041  forecasting  system  (9:5-27,5-31).  Operationally, 
these  subjective  factors  influence  the  depot  workload  fore- 
cast, and  the  negotiation  process.  The  primary  variables 
used  in  computing  reparable  generations  forecasts  are: 

1.  Total  Organizational  Field  Maintenance  (OFM) 
Demand  Rate,  which  is  used  in  conjunction  with  the  Future 
Program  to  determine  the  projected  reparable  generations. 

2.  Base  Not  Reparable  This  Station  (NRTS)  rate, 
which  is  used  to  determine  which  portion  of  the  projected 
reparable  generations  is  expected  to  be  base  processed, 
and  which  portion  is  expected  to  be  repaired  at  the  depot. 

3.  Base  Condemnation  Rate,  which  is  used  to  deter- 
mine the  fraction  of  the  reparable  generations  processed 

at  base  to  be  condemned. 

4.  Depot  Condemnation  Rate,  which  is  used  to  deter 
mine  the  fraction  of  the  reparable  generations  processed 

at  the  Depot  to  be  condemned. 

5.  OFM  Depot  Demand  Rate,  which  is  used  on  conjunc 
tion  with  the  Future  Program  to  determine  the  projected 
depot  reparable  generations. 

6.  The  Program,  which  is  based  on  flying  hours, 
historical  data,  etc.,  and  is  used  in  conjunction  with  the 
OFM  demand  rates  to  determine  the  reparable  generations. 
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Tabulation  of  Variables 

Variables  of  significance  in  this  research  effort 


are  shown  in  Table  2.1. 


Data  Requirements 


This  research  effort  focused  upon  a time-series 
forecasting  methodology  and  required  36  forecasts  of  pro- 
jected depot  reparable  generations  and  the  corresponding 
36  data  points  of  actual  reparable  generations  for  each 
demand  pattern  analyzed.  Thirty-six  data  points  yielded 
sufficient  terms  so  that  the  Central  Limit  Theorem's  asser- 
tion of  normality  applied. 

Each  demand  pattern  consisted  of  120  data-points 
which  allowed  using  the  first  24  as  the  forecasting  base 
from  which  36  forecasts  were  generated  at  different  lead 
times  and  then  comparisons  were  made  with  the  actual 
data-points  and  the  forecast  errors  were  computed  and 
analyzed.  The  demand  patterns  created  were: 

Poisson.  A Poisson  process  was  generated  by  calling 
the  Poisson  function  from  the  School  of  Systems  and  Logis- 
tics Time  Sharing  Library  and  generating  120  data-points. 

The  mean  was  set  at  ten  and  the  series  was  begun  with  the 
SEED  (12345). 


Linearly  Increasing  Poisson.  The  Poisson  function 
was  called  and  the  mean  was  originally  set  at  ten  and  then 
the  mean  was  incremented  by  1/12  with  each  data-point 
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generation.  The  mean  of  the  120th  data-point  was  20.  The 
series  was  begun  with  the  SEED  (12345) . 

Linearly  Decreasing  Poisson.  The  Poisson  function 
was  called  and  the  mean  was  set  at  40  and  then  the  mean  was 
decremented  by  1/6  with  each  data-point  generation.  The 
mean  of  the  120th  data-point  was  20.  The  series  was  begun 
with  the  SEED  (12345). 

Alternating  Linear  Poisson.  The  Poisson  function 
was  called  and  the  mean  was  originally  set  at  20  and  then 
the  mean  was  incremented  by  1/3  for  12  data-point  genera- 
tions and  then  decremented  by  1/3  for  6 data-point  genera- 
tions and  this  process  continued  until  120  data-points  were 
generated. 

Sine.  The  sine  and  Poisson  functions  were  called 
and  the  mean  of  the  Poisson  was  set  at  10  and  then  allowed 
to  vary  according  to  a sine  function  of  amplitude  20.  In 
order  to  eliminate  negative  values,  this  oscillating  func- 
tion was  shifted  up  by  adding  a constant  of  value  50  to  it. 
The  series  was  begun  with  the  SEED  (12345)  and  120  data- 
points  were  generated. 


Research  Approach 

The  purpose  of  this  research  was  to  analyze  the 
impact  that  errors  in  the  depot  reparable  generations  fore- 
casts might  have  upon  the  scheduling  of  actual  production 
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through  the  depot  facility.  The  basic  approach  was  to: 

(1)  describe  the  underlying  forecasting  model  used  by  the 
D041  system  to  predict  depot  reparable  generations; 

(2)  compare  the  DO 41  forecasts  of  a limited  number  of  depot 
reparable  items  with  the  actual  number  for  the  same  periods; 
and  (3)  use  time  series  analysis  techniques  to  forecast 
from  the  same  data  base  and  to  compare  the  results. 

Testing  the  First  Research  Hypothesis 

The  first  research  hypothesis  is:  the  expected 
value  of  the  distribution  of  the  forecast  errors  presently 
found  in  the  D041  system  is  not  zero,  i.e.,  that  the  fore- 
cast is  biased.  For  example,  if  eT  represents  forecast 
errors  over  a forecast  horizon  of  length  x,  then  E(eT)  ^ 

0,  x = 1,2,3,...,  if  t\e  forecast  is  unbiased. 

The  statistical  hypothesis  can  be  formulated  as: 

Hq:  E(eT)  = 0 

1^:  E(eT)  * 0 

Since  each  forecast  is  an  estimator  of  the  actual  reparable 
generations,  the  symbol  used  in  this  research  for  the  fore- 

A 

cast  of  g(t)  made  x periods  earlier  is  gT(t).  The  forecast 
error  term  eT(t)  at  time  t is  defined  as  the  difference 
between  the  forecast  value  for  time  t made  at  time  t-x  and 
the  actual  value  observed  at  time  t.  It  is  computed  as: 
eT(t)  = gT (t)  - g (t)  for  some  fixed  lead  time  x.  The  eT(t), 
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t = 1,2,3,...  will  yield  a distribution  of  forecast 
errors  for  the  fixed  lead  time  t. 

To  facilitate  understanding,  imagine  that  time  has 
been  stopped  at  the  beginning  of  the  first  quarter  FY73. 
Using  the  single  moving  average  method  at  this  fixed 
point  in  time,  the  forecast  mechanism  can  compute  the 
forecast  for  every  time  point  in  the  future  (i.e.,  for 
any  lead  time  x) , using  the  previous  24  months  of  actual 
generations.  Since  this  fixed  point  in  time  has  passed, 
the  actual  number  of  generations  is  available  for  compari- 
son so  that  the  e (t)  can  be  computed  (see  Table  2.2). 

The  expected  value  of  the  forecast  error  for  lead 
time  x is  computed  as  follows : 

n 

^^C§T(t)-g(t)  ] 

E(eT)  = — • (2.1) 


This  is  the  expected  value  for  the  error  distribution 
associated  with  a fixed  lead  time,  x.  if  the  lead  time  is 
changed,  another  forecast  error  distribution  and  its  mean 
can  be  generated. 

Lead  times  of  1,  3,  6,  9,  12,  15,  18,  24,  27, 

30,  33,  and  36  months  were  used  in  this  study  and  the  fore- 
casting data  base  was  fixed  at  24  data  points.  Thirty- 
six  forecasts  were  computed  for  each  lead  time  and  these 
36  forecasts  yielded  36  forecast  errors  and  can  be  approxi- 
mated by  a normal  distribution  by  virtue  of  the  Central 
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Limit  Theorem.  The  research  hypothesis  was  designed  to 
determine  whether  the  forecasting  method  used  in  the  D041 
system  was  biased.  The  risk  of  making  a Type  I error  in 
the  statistical  test,  i.e.,  the  probability  of  rejecting 
a true  hypothesis,  should  be  minimized.  The  level  of 
significance  (a)  was  set  at  .01  with  the  resulting  two- 
tailed  test  having  a rejection  region  of  .005  in  each 
tail.  The  hypothesis  test  was  applied  to  each  of  the 
forecast  error  distributions.  If  the  statistical  test 
yielded  the  rejection  of  the  null  hypohtesis,  the  infer- 
ence to  be  made  was  that  the  D041  forecast  was  biased  for 
the  forecast  horizon  tested  because  the  probability  of 
observing  an  error  of  that  magnitude  would  be  .01  or 
less  if  the  hypothesis  was  true. 

A computer  program  was  written  to  compute  the 
forecast  errors  and  to  make  the  statistical  test  for  the 
D041  forecasting  method  (see  Appendix  A) . 

Testing  the  Second 
Research  Hypothesis 

The  second  research  hypothesis  states:  Other  fore- 

casting methods  may  exhibit  less  bias  than  the  D041  fore- 
casting method.  If  the  D041  forecasting  method  was  shown 
to  be  biased  for  items  whose  reparable  generations  followed 
a particular  pattern,  then  time  series  analysis  forecasting 
methods  were  used  to  determine  if  an  appropriate  model 
existed  which  would  exhibit  less  bias. 


to 


The  same  demand  patterns  which  resulted  in  biased 
forecasts  using  the  D041  forecasting  method  were  used 
to  make  time  series  analysis  forecasts.  The  same  data 
bases  and  lead  time  periods  were  used.  The  forecast 
errors  for  each  forecast  were  generated  and  e^  for  each 
lead  time  was  computed.  After  Efe^)  for  each  lead  time 
had  been  computed,  the  same  hypothesis  test  used  in  evalu- 
ating the  D041  forecasting  method  was  used,  i.e., 


Hq  : E(ex)  = 0 


Hl  : E(eT)  ? 0. 


The  alpha  level  was  again  set  at  .01  with  the  resulting 
two-tailed  test  having  a rejection  region  of  .005  in  each 
tail.  If  the  statistical  test  yielded  the  rejection  of 
the  null  hypothesis,  the  inference  to  be  made  was  that 
the  time  series  analysis  forecast  was  biased  because  the 
probability  of  observing  an  error  of  that  magnitude  would 
be  .01  or  less,  if  the  null  hypothesis  was  true.  If 
the  null  hypohtesis  cannot  be  rejected,  then  the  inference 
is  that  the  time  series  analysis  forecast  exhibits  less 
bias  than  the  D041  forecasting  method. 


Time  Series  Analysis 

A general  discussion  of  time  series  analysis  is 
presented  in  this  section,  with  a more  mathematical 
approach  presented  in  Appendix  B. 
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A time  series  is  a set  of  observations  taken  at 
equally  spaced  time  intervals,  i.e.,  Gfc,  t=l , 2 , 3 , . . . ,n. 

As  opposed  to  the  usual  assumption  of  stochastic  inde- 
pendence of  observations  found  in  other  forecasting  tech- 
. niques,  time  series  models  specifically  assume  that  the 
observations  are  correlated.  Box  and  Jenkins  have 
developed  an  iterative  method  for  modelling  this  depen- 
dence among  the  observations  in  a time  series  (1) . Rather 
than  being  a model  fitting  process,  it  is  a model  building 
process,  with  the  model  determined  upon  the  basis  of  data 
analysis  rather  than  upon  assumption. 

Determination  of  a model  is  accomplished  in  a 
stage  called  identification.  This  stage  is  followed  by 
parameter  estimation,  and  then  by  a series  of  diagnostic 
checks  to  determine  if  the  model  identified  provides  an 
adequate  description  of  the  stochastic  process  generating 
the  data.  If  in  the  checking  phase  the  model  is  shown  to 
be  deficient  in  some  way,  then  the  identification  phase 
is  reentered  and  the  entire  process  is  repeated.  When  a 
model  has  been  adequately  identified,  then  it  is  used  to 
forecast  future  realizations  of  this  stochastic  process. 

The  basic  tools  for  identification  of  the  time 
series  are  the  autocorrelation  and  partial  autocorrela- 
tion functions  of  the  observed  data  (1:147).  The  general 
time  series  model  can  be  described  in  the  following  manner. 
First,  a backshift  operator  is  defined  such  that 
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Bgt=gt_^.  The  symbol  at  is  used  to  denote  a random  error, 

entering  the  model  in  period  t.  These  errors  are  assumed 

to  be  independent,  normally  distributed  random  variables 

with  E(at)=0,  for  t=l , 2 , 3 , . . . ,n  and  a constant  variance 
2 

a . Using  these  definitions  and  disregarding  seasonal 

Cl 

and  trend  factors,  the  most  general  form  of  the  Box- 
Jenkins  model  has  an  "autoregressive-intergrated-moving 
average" (ARIMA)  form: 


(l-^B-^B 


. .4  BP)  ( 1 — B ) dG. 


where : 


= (l-e-B-e_B2-. . .-0  Bq) a (2.2) 

12  q t 


Gfc  if  d > 0 


[Gfc-y  if  d=0 


= the  series  mean 
= values  of  g at  time  t 
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= l,2,...,p  are  autoregressive  parameters  and 
appear  in  the  autoregressive  factor  in  the 
model. 

= l,2,...,q  are  moving  average  parmeters  and 
appear  in  the  moving  average  factor  in  the 
model . 
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The  model  is  based  on  the  assumption  that  the  series  Gfc 
is  stationary.  If  the  series  is  not  stationary,  then  its 
first,  second,  etc.  differences  are  investigated  for  sta- 

j A 

tionarity,  i.e.,  by  checking  (l-B)aG(t)  for  various 
values  of  d.  Shorter  notation  for  the  complete  model  is 
ARIMA  (p,d,q) , where  p refers  to  the  order  of  autoregres- 
sive factors,  d refers  to  the  order  of  differencing,  and 
q refers  to  the  order  of  the  moving  average  factors. 

Two  very  important  concepts,  stationarity  and  dif- 
ferencing, have  been  introduced  and  require  further 
explanation.  A process  is  considered  stationary  if  the 
joint  distribution  of  the  observations  does  not  vary  with 
t ime  ; 


p(gt gt+T}  = P(gt+m gt+x+m)  (2-3) 

where  p (•)  is  the  joint  probability  density  function.  If 
the  process  is  stationary,  the  expected  value  of  the  obser- 
vations, E(gfc)  does  not  vary  with  time,  i.e.,  E(gt)  = 

E(W 

Further,  the  covariance  between  any  two  observa- 
tions from  a stationary  process  depends  only  upon  the 
number  of  time  period  (t)  separating  the  two  observations, 
i.e.  , 


C(9t-<W  * C(9t+m'<W™>)- 


(2.4) 


In  reality,  many  time  series  are  not  stationary. 


Nonstationary  time  series  may  have  a mean  which  is  itself 


a function  of  time  (1:7,85,92).  However,  such  series 


often  exhibit  what  is  called  homogeneous  nonstationarity , 


where  the  behavior  of  the  series  is  similar  at  different 


points  in  time.  If  a series  is  stationary  in  a homogen- 


eous sense,  the  successive  changes,  or  differences,  of 


that  series  are  stationary  (6:56).  Thus,  a nonstationary 


time  series  can  often  be  analyzed  by  finding  a difference 


of  the  series  which  is  stationary  and  applying  stationary 


techniques  to  it  (6:56-58). 


Usually,  it  is  necessary  to  take  only  the  first  or 


second  differences  of  a series  to  achieve  stationarity 


(1:175).  The  first  difference  of  a series,  b , is  defined 


to  be  At (g) =gfc-gt_^ . The  second  differences  are  the  dif- 


ferences of  the  first  differences  and  are  given  by: 


At(g)  - At(g)-At_1(g)  - (gtst.1)-(gt.1-gt.2) 


* 9t~29t-i+9t-: 


The  computer  programs  for  building  the  Box  and 


Jenkins  models  for  the  analysis  and  forecasting  of  time 


series  are  available  on  the  CREATE  system  (see  Appendix  D) 


A computer  program  that  will  generate  data  files  from  ran- 


dom generators  or  accept  data  entered  from  the  terminal 
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to  be  read  into  the  Box  and  Jenkins  programs  has  been 
developed  and  is  included  in  Appendix  C. 

List  of  Assumptions 

1.  The  D041  forecasting  procedures  have  been 
unchanged  and  have  been  consistently  employed  during  the 
period  analyzed  by  this  research. 

2.  The  distribution  of  forecast  errors  follows 
a normal  distribution. 

3.  The  e (t)  terms,  for  a set  lead  time  r are 

T 

independent. 

4.  The  afc  terms  are  independent,  with  an  expected 

2 

value  of  0,  and  a constant  variance  a . 

a 

List  of  Limitations 

1.  The  results  of  this  research  will  be  limited 
to  the  forecasting  methods  (principally  time  series  models) 
applied  to  the  reparable  generations  of  items  which  follow 
the  particular  distribution  patterns  discussed. 

2.  The  guide  for  the  development  of  data  files 
for  input  to  the  Box  and  Jenkins  models  and  for  the 
estimation  and  identification  of  time  series  models  are 
applicable  only  to  the  computer  programs  on  the  CREATE 
system.  Slight  modifications  may  be  required  to  adapt 
the  programs  to  other  systems  and  computer  languages. 
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CHAPTER  III 
FINDINGS  AND  ANALYSIS 
The  D041  Forecasting  Method 

The  D041  system  uses  a single  moving  average 
forecasting  method  based  on  the  most  recent  eight  quar- 
ters of  data  as  its  basic  forecasting  method  (2) . In 
computing  the  OFM  Demand  Rate,  which  is  used  in  conjunc- 
tion with  future  programs  to  determine  projected  reparable 
generations,  the  D041  system  uses  a weighted  single 
moving  average  forecasting  method  (2) . The  weight  depends 
on  the  changes  in  the  program  and  OFM  rates . 

Consequently,  when  the  quarterly  programs  follow 
a decreasing  trend  (see  Figure  3.1)  the  currently  computed 
OFM  demand  rate  will  be  less  responsive  to  these  trends 
in  the  quarterly  rates  than  will  the  single  moving  aver- 
age of  the  quarterly  rates.  Also,  if  the  quarterly  OFM 
demand  rates  follow  an  increasing  trend  (Condition  1 
on  Figure  3.1),  the  computed  demand  rate  will  be  smaller 
than  the  single  moving  average  of  the  rates,  and  if  the 
quarterly  OFM  demand  rates  follow  a decreasing  trend 
(Condition  2 on  Figure  3.1),  the  computed  demand  rate 
will  be  larger  than  the  single  moving  average. 

Similarly,  when  the  quarterly  programs  follow  an 

increasing  trend  (see  Figure  3.2),  the  currently  computed 
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OFM  demand  rate  will  be  more  responsive  to  any  trends  in 
the  quarterly  rates  than  the  single  moving  average  of  the 
quarterly  rates.  Thus,  if  the  quarterly  OFM  demand  rates 
follow  an  increasing  trend  (Condition  3 on  Figure  3.2), 
the  computed  demand  rate  will  be  larger  than  the  single 
moving  average  of  the  rates,  and  if  the  quarterly  rates 
follow  a decreasing  trend  (Condition  4 on  Figure  3.3), 
the  computed  demand  rate  will  be  smaller  than  the  single 
moving  average  of  the  rates. 

In  the  situation  where  the  program  is  approxi- 
mately constant  over  time,  the  OFM  demand  rate  is 
essentially  computed  using  a single  moving  average. 

The  equation  for  a single  moving  average  method 
using  ”x”  for  the  length  of  the  base  period,  "t"  for  the 
period,  and  "G"  for  the  variable  would  be: 


t 


where : 

tg  = t-(x-l). 

The  equation  for  a weighted  single  moving  average 
using  "W"  for  the  weighted  value  would  be: 


WSMAt 


(3.2) 


where : 

tQ  = t-(x-l) 

The  D041  forecasting  method  was  investigated 
to  determine  if  the  forecasts  were  biased.  A single 
moving  average  forecasting  model  was  computerized 
(Appendix  A)  and  was  used  to  compute  the  bias  of  fore- 
casts using  five  computer-generated  data  patterns. 

The  forecast  was  initialized  on  the  first  24  data  points 
of  each  pattern  and  then  36  forecasts  were  made  incre- 
menting the  data  base  by  one  for  each  data  pattern 
and  for  each  of  the  13  different  lead  times.  This  method 
yielded  36  forecast  errors  for  each  lead  time  and  these 
forecast  errors  were  used  to  determine  the  bias  and 
statistical  significance  of  the  bias  for  each  data 
pattern.  Lead  times  of  1,  3,  6,  9,  12,  15,  18,  21,  24, 
27,  30,  33,  and  36  months  were  used  in  this  study. 

The  results  of  this  analysis  are  presented  below 
for  each  pattern  tested. 
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Poisson  Pattern 

As  expected,  forecasts  of  data  from  this  pattern 
were  found  to  be  statistically  unbiased  for  each  lead  time 
(refer  to  Tables  F.l  and  F.2). 

Linearly  Increasing 
Poisson  Pattern 

The  single  moving  average  forecasts  were  found  to 
be  statistically  biased  at  an  alpha  level  of  .01  for  lead 
times  of  6 months  of  greater.  For  the  lead  time  of  3 
months,  the  forecast  was  statistically  biased  at  an  alpha 
level  of  .05  and  at  a lead  time  of  1 month  was  statistic- 
ally biased  at  alpha  level  of  .10  (refer  to  Tables  F.3 
and  F. 4) . 

Linearly  Decreasing 
Poisson  Pattern 

The  results  were  essentially  the  same  as  those  of 
the  linearly  increasing  Poisson  pattern  except  the  fore- 
casts were  more  biased  because  a steeper  slope  was  used 
for  this  pattern.  The  forecasts  were  statistically 
biased  for  all  lead  times  (refer  to  Tables  F.5  and  F.6). 

Alternating  Linear  Increasing 
and  Decreasing  Poisson  Pattern 

The  single  moving  average  forecasts  were  found  to 
be  statistically  biased  at  lead  time  of  12  months  or 
greater  at  an  alpha  level  of  .01.  At  lead  times  of  one 
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to  9 months  the  forecasts  were  statistically  biased  at 
an  alpha  level  of  .10  (refer  to  Tables  F.7  and  F.8). 

Sine  Pattern 

The  sine  pattern  was  found  to  be  statistically 
unbiased  at  an  alpha  level  of  .01,  but  was  found  to  be 
biased  for  all  lead  times  at  an  alpha  level  of  .20 
(refer  to  Tables  F.9  and  F.10). 

Time  Series  Analysis  Forecasting  Results 

This  section  contains  the  results  from  the  time 
series  analysis  forecasting  techniques.  The  single 
moving  average  forecasting  method  was  shown  to  be  unbiased 
at  an  alpha  level  of  .01  for  the  Poisson  and  Sine  patterns 
and  consequently  time  series  analysis  forecasts  were  not 
made  for  these  data  patterns.  Time  series  analysis 
methods  were  used  to  make  forecasts  for  the  linearly 
increasing  Poisson,  linearly  decreasing  Poisson,  and 
alternating  linear  Poisson  patterns.  The  initial  24  data 
points  were  used  as  the  forecasting  base  for  each  data 
pattern.  From  that  base,  forecasts  were  made  96  time 
periods  into  the  future.  Thirty-six  forecast  errors  were 
computed  for  each  lead  time  and  the  same  statistical  tests 
used  in  determining  the  bias  of  the  single  moving  average 
forecasts  were  used  to  determine  the  bias  of  the  time 
series  analysis  forecasts.  The  time  series  analysis 


CHAPTER  IV 


CONCLUSIONS  AND  RECOMMENDATIONS 
Conclusions 

The  first  research  hypothesis  stated  that  the 
expected  value  of  the  distribution  of  forecast  errors 
presently  found  in  the  D041  system  was  not  zero,  i.e. , 
the  forecast  was  biased.  The  research  concluded  that 
reparable  generations  which  followed  a Poisson  process 
with  a constant  mean  or  one  in  which  the  mean  followed 
a sine  distribution  could  be  forecast  without  bias  by 
using  the  D041  single  moving  average  forecasting  method. 
The  research  concluded  that  the  single  moving  average 
forecasting  method  produced  biased  forecasts  of  reparable 
generations  of  recoverable  items  when  the  reparable  gener- 
ations followed  a linearly  increasing  Poisson,  a linearly 
decreasing  Poisson,  or  an  alternating  linear  Poisson 
pattern. 

The  second  research  hypothesis  stated  that  other 
predictive  methods  might  exhibit  less  bias  than  the  DO  41 
forecasting  method.  The  research  concluded  that  time 
series  analysis  techniques  could  be  used  to  produce 
unbiased  forecasts  for  linearly  increasing,  linearly 
decreasing  and  for  alternating  linear  Poisson  processes. 
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Recommendations 

Recommendations  for  further  research  include  the 
following : 

1.  Inasmuch  as  this  thesis  was  based  on  data 
generated  following  specific  random  processes,  future 
studies  should  be  accomplished  using  representative  actual 
data . 

2.  Since  a number  of  random  processes  were  investi- 
gated in  this  thesis,  a future  study  could  be  accomplished 
to  determine  if  particular  federal  stock  classes  of 
recoverable  items  can  be  identified  to  follow  any  of  these 
patterns . 

3.  An  investigation  of  different  error  criteria 
such  as  the  mean  absolute  error,  average  absolute  dif- 
ference, and/or  relative  forecast  error  to  determine  the 
bias  of  the  time  series  analysis  and  single  moving  average 
forecasting  methods  should  be  performed. 

4.  An  examination  should  be  made  of  the  negotia- 
tion process  to  determine  the  differences  between  fore- 
casted reparable  generations  and  the  number  of  units  nego- 
tiated to  be  put  in  repair  for  the  same  time  period. 

5.  An  examination  should  be  made  of  the  produc- 
tive process  with  emphasis  on  the  accuracy  of  the  methods 
used  in  forecasting  availability  of  manhours,  facility 
time,  and  spare  parts  in  the  maintenance  system.  A 
comparison  could  be  made  using  the  methodology  of  this 
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research  effort  to  determine  the  accuracy  by  comparing  the 
forecasted  availability  with  actual  availability.  If 
discrepancies  are  found,  other  forecasting  methodologies, 
particularly  time  series  anlaysis  should  be  investigated 
r further,  to  determine  if  better  forecasts  can  be  made. 

: 


APPENDIX  A 


FORECAST  BIAS 

The  program  contained  in  this  appendix  was  designed 
to  test  the  bias  of  the  single  moving  average  and  other 
forecasting  methods  such  as  the  time  series  analysis 
method.  The  program  requires  a minimum  of  60  data  obser- 
vations and  can  handle  up  to  200  data  observations. 

The  first  24  data  points  are  as  used  in  the  initial  fore- 
casting base  and  then  36  forecasts  are  computed  increment- 
ing the  forecasting  base  by  one  for  each  new  forecast. 

The  error  terms  can  then  be  computed  at  different  lead 
times  and  the  bias  can  be  determined  as  these  lead  times. 
The  program  also  computes  a statistic  which  can  be  compared 
with  t distribution  values  to  determine  the  alpha  level 
of  significance  of  the  bias.  Data  observations  can  be 
entered  directly  from  the  terminal  or  the  random  processes 
which  are  built  into  the  program  can  be  used  to  generate 
observations . 

It  should  be  noted  that  if  a lead  time  of  36  is 
being  investigated,  at  least  96  observations  would  have 
to  be  initially  entered  into  the  program.  The  program  uses 
the  first  24  as  the  initial  forecasting  base,  then  computes 
36  forecasts  which  requires  up  to  and  including  the  60th 


41 


42 


data  point,  and  then  finally  another  36  observations  are 
required  to  compute  36  forecast  errors  which  require  up 
to  and  including  the  96th  data  point.  If  insufficient 
data  points  are  entered,  the  program  will  use  zeroes  for 
the  required  remaining  data  points  and  the  results  will 
obviously  be  inaccurate. 

The  computer  program  was  written  in  the  FORTRAN  IV 
computer  language  and  can  be  run  as  follows : 

1.  The  program  asks  for  the  number  of  observa- 
tions. These  observations  can  be  entered  from  the  terminal 
or  from  the  built-in  random  processes. 

2.  The  program  asks  if  single  moving  average  fore- 
casts are  desired  or  if  the  bias  of  another  forecasting 
method  is  being  investigated.  If  single  moving  average 
forecasts  are  desired,  then  the  program  automatically 
computes  36  single  moving  average  forecasts  initializing 

on  the  first  24  data  points.  Lead  times  can  then  be 
entered  and  the  program  computes  ■‘■he  bias  and  test 
statistic  for  the  entered  lead  times. 

3.  If  testing  the  bias  of  another  forecasting 
method,  lead  times  can  be  entered  and  the  bias  and  test 
statistic  for  thase  lead  times  will  be  computed  by  the 
program. 

The  following  random  processes  were  built  into 
the  program: 


| 

11 


V 


43 

1.  A Poisson  process  with  a mean  of  ten. 

2.  A Poisson  process  with  a linearly  increasing 
mean.  The  mean  is  initially  set  at  10  and  then  is 
incremented  by  one-twelfth  for  each  newly  generated  data 
point . 

3.  A Poisson  process  with  a linearly  decreasing 
mean.  The  mean  is  initially  set  at  40  and  then  is 
decremented  by  one-sixth  for  each  newly  generated  data 
point. 

4.  A Poisson  process  with  an  alterating 
linear  mean.  The  mean  increases  for  the  first  12  data 
points  and  then  decreases  for  the  next  6 data  points  and 
then  continues  to  increase  and  decrease  in  the  same  manner 
to  the  120th  data  point  and  then  increases  from  this  point 
to  the  200th  data  point. 

5.  A process  whose  values  are  the  sine  function 
of  the  output  of  a Poisson  process. 

The  complete  computer  program  comprises  the 
remainder  of  this  appendix. 


I 

1 


wa« 

GO  O 
<lKh 

a o 

KlitN 


fa 

23  fa 

co 

fa 

O 03  23 

fa 

►H  < O 

o 

fa  = 

Eh  fa  fa 

hH 

<3  H 

“ 

D2IX: 

Ei 

25 

CO 

P i-h  w 2: 

< 

l-H  05 

25 

t-c  fa  O 

> 

S fa 

O 

05  05  i-h 

S3 

03  Eh 

KH 

Eh  O O Eh 

fa 

fa  23 

Eh 

CO  23  fa  fa 

CO 

CD  Eh  fa 

< 

HW  a 

a 

CO 

> 

P CO  CO  i-h 

a 

o 

S 03 

05 

< 05 

03 

fa  o o 

fa 

23  fa  W E H 

hH 

OS  fa 

X 03  eh 

CO 

O 05  23  CO 

*o 

• fa  <5 

a 

CO  O H H 

/*N 

• 

CO 

03 

o 

CO  fa  CO  Q 

03 

rH 

CO  05 

CO  < fa 

M P 

rH 

* 

H W 

CO  /-'Eh  2: 

w 

O 05  U 

• 

"-.fa 

'-'CD  «3J  W 

05 

fa  05  O H 

CO 

CM 

m s 

<COOG 

O Eh  ^ 

O fa  05 

rH  • 

rH 

hH  p 

Hf  fa 

05  fa  Eh 

•»  rH 

fa  23 

CO  H O S 

05  O 

O in  fa 

rH  •* 

hH 

P 

-X  23  O 

P fa  23 

fa  CO  s 

rH  • 

+ 

W 

/vX  HH  p 

Eh  Eh  - 

23  O 

« o 

o 

os  n 

o >o5  2 

5h  rH 

H 23  O fa 

O fa  rH 

fa  rH 

fa 

C Eh 

O /-Nfa  <3 

O fa  II 
C M 

o co  a 

03  O 

O 

o 

/-N 

(N  O Eh  05 

^>05  CO  CO  05 

*23  23 

fa  fa 

fa 

p o: 

wo  2: 

•* 

w in  w co  t-i  w 

fa  -O 

• o 

►H  w 

<NWS 

rH  ^ 

fa  h}*  Eh  H O fa 

•rH  C0  fa 

H CO  fa 

rH 

fa  H 

v-/  O 

• h:  : 

O 2 CO  23  O fa  X 

: CO  II  CO  p 

o 

II  CO  fa  O 

ii 

=3  25 

25  fa  fa  05  o : 

O’ 

03  hh  CM  fa  fa  05 

O * l-H  I-H  Z 

G}  H H 2 Oi  H 

wHfaOSHli, 

X fa  = <:  : 

H H H - Z Z Z 

= 05  rH  O *-H  rH 

O H H 

II  = 

O h h : r r r 

•*  * 

23  ^ * > > > 

w a,  fh 

CO  fa  Eh 

c- 

* «* 

23  CO  Eh  « - - > 

X X EH  *EH  eh 

O O Q Eh  Eh  Eh  fa 

Eh  -O  II  23  O 

II  fa  O 

Eh 

>2;  Eh  Eh  EH  Eh 

-X  23  P 23  23 

Eh  CJ  2 23  23  23  2 

25QEhO^>OEhO^OEhO 

P X (H  < I-H  HH 

Khhhh 

*-•  < a h u 

P I-H  U 

p 

Oh<IS<!hhhh 

<Jj'^05WC5  05OO  II  05  05  05  05  05  fa  O w 

o 

o 

C505fai-HWC5  05  05  05Wfafa05fafaO*i<XfafafafafaC30rH<!:cM 

o 

CO  < CD  O 

in 

% ft  K Q 

* 

o in  ® o 

rl  rl  H N 


ft  ft  Oj  ^ 

o 

o o o o o 

CO  t<-  00  03  H 


ooooooooooooo 

HMCO^intONOOfflOHNCI 

HrlrlrlHHHrlrlNNNN 


ooooooooo 

'untot'ooooHn 

CMCMCMCMCMCMCOCOCO 


V 


46 


■ 

) 


\ 


o 

in 

+ 


co 


co 


= o 

Eh 

co  w 
< co 
O 1-1 

ft  gs 

K ft 


ft 


O 

ft 

ft 

ft 


Eh 

O ft 
a ft 

< Eh 

ft  ft 

O 

S co 
O < 
co  o 

l-H  ft 

ft  ft 

< o 

ft  ft 


o 

o 

ft 

< 

o 

l-H 

&H 

m 


w 

o 

*sj 

ft 

ft 

> 

< 

a 

z 


Eh  l-H 


CO 

— . 

•% 

< 

> 

' — 

• 

/— N 

Eh 

o 

cm  oo 

o 

• 

CO 

a 

CM 

o 

o o 

rH 

o 

CO 

01 

rH  ft 

N*- 

o 

o 

ft 

ft 

1 

1 1 

z 

ft 

z 

p 

l-t 

l-H  W 

o 

i • 

l-H 

o 

o 

w 

v-/  ' — ■* 

CO 

o 

w 

z 

Eh 

+ 00 

1 ft  + 

CO 

ft 

ft  ft 

«i; 

ft 

w 

O O Hi*  on  ft  ft 

o 

^ o • 

a 

CO 

Q 

<5 

03 

CO  ft  CO  Z CO  o 

o 

z 

• z * 

O ft 

+ 

SH  • 

w «w  Z 

ft 

«k 

m -in 

ft 

< 

'H  + 

/•s 

Z ft  CO 

z ft  oi  z w 

i-H 

W ft  W 

l-H 

ft  CO  CO  ft 

CO 

ft 

O P o 

O ft  o o p rH 

z 

n 

eh  w ii  a w 

£ 

M 

ftft 

• CO  CM  ft 

ft 

CO 

CO  Z H 

CO  Z rH  CO  Z II 

l-H  H 

l-H 

Z p ft  ft  p 

•k 

Eh 

O’  - - 

< p 

- 

< p = 

CO  ft  II 

CO  ft  II  CO  l-H  O >-H 

CO  ft 

O 

o Z o ft  z o 

Eh 

l-H 

ft  ft  ft 

a z 

z 

ft 

a z 

H Eh  h 

H Eh  h h £h  Cl 

* z 

Cl  o- 

ftftClftftftClZ 

5= 

• II  II 

C0  ft 

l-H 

II 

co  ft  : 

O Z 

O Z O Z rH  Cl 

O H rl  H 

REHftNSHftft 

ft 

ft  ft  1-3 

II  Eh 

Eh 

l-H 

II  Eh  - 

ft  O M 

ft  O CO  ft  O 

IN  Eh 

ft  Z ft  Z 

ft 

- 

ft 

ft 

'"'Z 

Z 

■^Z  Eh 

II  O ft 

II  Oft  11  U O O 

II  Z 

o o 

II  O O O II  o o 

ft 

Eh 

• 

ft  O O 

ft  o 

o 

o 

ft  o z 

'-H  H 

-r-i  rH  /-H  Eh  O 

/-nO 

Eh  ft 

'■HU  Eh  ft  U Eh 

Z 

Q 

ft  ft  03 

V^U 

u 

CO 

w u ft 

M rH 

l-H  CM  M CO 

ft  o 

ft  ft 

ft 

WH 

< 

<c 

< ft 

w rH  O 

h— ' rH  O^rlQrl 

v-/ 

o CO 

vNOCI'HrlO 

Cl 

ft 

ft 

ft  Q O 

a o 

O 

o 

a o ft 

•3J  rH  Q 

<rlQ<HOH 

< Cl 

a ft 

< ft  O ft  < 03  U 

ft 

ft 

ft 

ft  a a 

C0  03 

ft 

a 

CO  CO 

O O O 

o o o o o o o 

o o o o 

oooooooo 

o 

o 

o o o 

o o 

o 

o 

o o o 

t>  00  01 

O H CM  CO  HJ1  in  CO 

Ch  00  Cl  o 

ftNcorfincoc-co 

Cl 

o 

ft  CM  CO 

o*  m 

CO 

t> 

00  Cl  o 

CO  CO  CO 

t>  Ch  O.C-  Ch  |>  t- 

t>  C~  Ch  00 

00  00  00  00  00  00  00  00 

00 

Cl 

Cl  Cl  Cl 

Cl  Cl 

01 

Cl 

Cl  Cl  ft 

* <WP.. 


1010  PRINT 


TP 

00 

Pi 

•k 

X 

CO 

“ 

O' 

CO 

• 

Eh 

00 

CO 

Pi 

< 

•k 

O = 

X 

w w 

CO 

OS  CO 

* 

o < 

•O' 

u*  a 

00 

w < 

p< 

O Eh 

•k 

< < 

X = 

« Q 

CO  K 

W 

^ k(H 

> Eh 

Tjl  hh 

< 2 

• Eh 

HH 

oo 

O O 

p<  g 

2 P 

- < 

M 

X w 

> Tp 

CO  P 

O CM 

a 

•o>  es 

< 

w 

'H  • pj 

CD  00  Eh 

J 2 

CO  p,  2 

o o 

- - PI 

2 

iH  X 

•h  Q 

II  CO 

CO  w 

t-H  * 

CO 

-TP 

CO  < 

• 

CO  ffl 

►H  00 

T-'  Pi 

a x 

C0  CM 

w w I 

OS  OS  CS  CS 
Oh  C«  C*  Pi 


-Eh 

= = O < 
O S 
= = <N  OS 
- - O 
Eh  Eh  Eh  fc, 
Z Z 2 

H W W O 

« OS  OS  O 
C,  0,  (0,  CM 


i co 

CO  <-h  CO  PI  CO  CO 

njwnxw  "-co 
> - ^ p -+  p a - 

: : : HCZHazprH 

ii  a h ii  a h w ii 
O : : : jratjJWhXJ 

o ii  o o o - - - pa  ii  2 x 2 ii 

II  2 II  n II  Eh  Eh  Eh  -0^0  0 II  OOSO 
aX»pEH222Q®  JONSOX00 
DWngcOHHH^  ^ P W 

GOXEhWQSOSOSWOWOOWOO 

X<tOWtHftC.ftCiOXCOfl«C'<Q 


1300  SXIS=SXIS+(XE(L)**2) 

1310  80  CONTINUE 

1320  STND=( (SXIS-(36*(AVEXR**2) ) )/35)**0 .50 
1330  TEST= ( AVEXR*6 ) / STND 
1340  PRINT,"  " 


c 

> 55 

W W 

° H 

i § 

ra  P* 


c 

m 

• co 

o < 

* Hi 

* m 


CO 

2 Ei 

CO 

2 CO  W 2 O 

< 

• 

CQ 

W X Eh  O 

o 

1-1  0 

s-/ 

CQ 

> 2 “O  O 

w 

If  <fi 

< 

* 

< P O 2 2 r-f 

2 

HH  W 

1 

* 

X 2 2 W < 

o 

«P 

✓“N 

/-N 

'-'Eh  2 > O 

pH 

/-N 

iH  ^ 

HH 

* CO  w < Eh  Eh 

H-l  2 

1 HH 

CD  X X 2 

CQ  w 

w 

co  “■ — “<  O 

W Eh 

a w 

>1 

s s=  a 

S 2 

+ X 

CO  X 

1 CD  P 

hh  w 

CD  hh  CO  X 

CO  CO  W 

W * CO  P rH 

Eh  = 

n^Hn  + w 

-^>co  + w hh  os  x o 

S-X  » 

“Wp  “5,  p 

s -co  p 

x w xx  cr 

“Eh 

rlSZrlDZ 

D rl  H 2 

CQ  > “ 2 W 

= 02 

II  H h ||  COhhCO  II  X h X < O Ph  2 • 

O hh 

'-'X  Ohz2 

- - 

= (Ncsoooooq 

II  2 X 2 X X 2 

'^'-'  = co  = 2 2 

- - - H II  II  II  II  I. 

HEhEhEh  skmqh 
2222OP«hhPG0 
hhhhOM>XHH 


ii  no 

E-i  “ O 
CO  Q CO 

w < 


2222ox<cocoehwoxoo 

ftftc.o.H^xxxisoxoa 


ii  O ii  o 
sono 
^ w 00 

CO  o > 

x o < o 

XNXD 


E-i  E-i  E-i  “2 

2 2 2 q 

H H H <j 

2 2 2 w 2 

ft  ft  A 2 Hi 


1970  770  STOP 
1980  END 


APPENDIX  B 


MATHEMATICAL  DESCRIPTION  OF  TIME 
SERIES  ANALYSIS  MODELS 

In  Chapter  II  a time  series  was  dj2ff.ned  to  be  a 
set  of  observations  taken  at  equally  spaced  time  inter- 
vals, e.g.,  gfc,  t=l,2,...,n.  A backshift  operatior  B 
was  defined  by  the  relationship  B9t=<3t_^  an<^  ^^t^t-m” 

A forward  shift  operator  F can  be  defined  as  the  inverse 
of  B (F=B_1),  e.g.,  Fgt=gt+1  and  Fmgt=gt+m. 

Another  operator  introduced  in  Chapter  II  was 
the  backward  difference  operator  A where 

Agt  = g^t.-L  = (1-B)gt  . (B.l) 

White  noise,  denoted  as  afc,  was  defined  as  a series  of 
independent  random  shocks.  A general  stochastic  process, 
g(t),  can  be  modeled  as  a linear  combination  of  these 
random  shocks.  This  combination  is  often  called  a linear 
filter  and  is  described  pictorially  below: 

gt  (B . 2 ) 


The  linear  filtering  operation  simply  takes  a weighted 
sum  of  previous  observations  so  that: 


(KB) 


Linear  Filter 
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g (t)  = y+at+i|;1at_1+ij>2at_2+.  . . (B.3) 

= y+4>  (B)  afc 


In  general,  y is  a parameter  that  determines  the  level  of 

2 

the  process  and  ip  (B)  =l+^B+t|i2B  + ...  is  the  linear  opera- 
tor which  transforms  the  sequence  a^,a^_^,...,  into 
and  is  called  the  transfer  function  of  the  filter. 

Autoregression  Models 

A model  which  can  be  extremely  useful  in  the 
representation  of  certain  stochastic  processes  is  the 
autoregressive  model.  In  this  model,  the  current  value 
of  the  process  is  expressed  as  a finite,  linear  combina- 
tion of  previous  values  of  the  process  and  a shock  afc. 
Denoting  the  values  of  a process  by  gfc ,gfc_^ ,gfc_2 , . . . and 
letting  , §t_2 > • • * be  deviations  of  g from  y (i.e., 

§t=gt-y),  then: 


§ 


t=*iW 


2gt-2+ 


g.  +a. 
P t-p  t 


(B . 4 ) 


is  called  an  autoregressive  (AR)  process  of  order  p.  The 

A 

reason  for  this  name  is  that  a linear  model  g=<J>^X^+ 

A A 

^2X2+  ' * * + <*>pXp+a  relating  a dependent  variable  g to  a 
set  of  independent  variables,  X^,X2>...,X  plus  an  error 
a is  referred  to  as  a regression  model,  that  is  g is  said 
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to  be  regressed  on  X]/X2' * ' ' ,Xp*  In  variable  g 

is  regressed  on  previous  values  of  itself;  hence  the  model 
is  autoregressive.  An  autoregressive  operator  of  order  p 
can  be  defined  as: 

♦ (B)  = l-^B-^B2-  ...  -<t>pBP  . (B . 5 ) 

The  autoregressive  model  may  then  be  written  concisely  as: 

♦(B)g=at.  The  model  contains  p+2  unknown  parameters, 

2 

U , ♦-  , ♦_  , . . . , <j)  ,0  which,  in  practice,  have  to  be  esti- 

A p 3. 

2 

mated  from  the  data  and  a is  the  variance  of  the  white 

a 

noise  afc. 

The  autoregressive  model  is  a special  case  of  the 
linear  filter  model  (B.4).  For  example,  eliminate 
from  the  right  hand  side  of  (B.4)  by  substituting 

§t_1=<J>1§t_2+<f)2§t-3+  •**  +<f,p§t_p_i+at-i*  likewise  substi- 
tute for  g 2,  anc^  so  on'  yielding  an  infinite  series  in 
the  a's.  Symbolically,  <J>(B)g^=at,  which  is  equivalent  to 
gt=ip(B)at  with  (B ) =<J>  ^(B).  An  autoregressive  process 
may  or  may  not  be  stationary.  For  the  process  to  be 
stationary,  the  ♦ ' s must  be  chosen  so  that  the  weights 
^l'^2'*’’  t*ie  ^ (B)  form  a convergent  series. 

Moving  Average  Models 

The  autoregressive  model  (B.4)  expresses  the 
deviation  on  the  process  g as  a finite  weighted  sum  of  p 
previous  deviations  ' '*^t-p 


plus  a random  shock 
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afc.  Equivalently  it  expresses  gfc  as  an  infinite  weighted 
sum  of  a's.  Another  kind  of  model  with  great  practical 
importance  in  representing  observed  time  series  is  the 
finite  moving  average  process  (1:10).  In  this  model  gfc  is 
linearly  dependent  on  a finite  number  (q)  of  previous 
a's.  Thus : 


g,  = a -0,a.  -0_a^_  - ...  -0  a^ 

t t 1 t-1  2 t-2  q t-q 


(B . 6 ) 


is  called  a moving  average  (MA)  process  of  order  q.  The 
name  "moving  average"  is  somewhat  misleading  since  the 
weights  1 , -9^ , “62“ • • • need  not  total  unity  nor  need 

they  be  positive.  If  a moving  average  operator  of  order 
q is  defined  as  9 (B) =1-0^B-02B2-  ...  -0^B^,  then  the  moving 
average  model  can  be  written  concisely  as : 


gfc  = 0 (B)  afc  , 


which  contains  q+2  unknown  parameters  , cr2 

X C£  cl 

which,  in  practice,  are  estimated  from  the  data. 


Mixed  Autoregressive-Moving 
Average  Models 

In  order  to  achieve  greater  flexibility  in  the  fit- 
ting of  actual  time  series,  it  is  advantageous  to  include 
both  autoregressive  and  moving  average  terms  in  the  model. 


l i ' < 

ft 
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This  leads  to  the  general  mixed  autogressive  moving  aver- 
age (ARMA)  model: 

gt  = *lgt-l+  **•  +Vt-P+at*6iat-l‘  0qat-q 

or  (B. 7) 

<>(B)gt  * 0(B)at 

which  employs  p+q+2  unknown  parameters,  y , , . . . , <(>  , 

2 

0 .,...,  0 ,o  ' which  are  estimated  from  the  data. 

X C[  cl 

Nonstationary  Models 

Many  time  series  encountered  in  practice  exhibit 
nonstationary  behavior,  i.e.,  they  do  not  vary  around  a 
fixed  mean  (1:11).  Such  series  may  nevertheless  exhibit 
homogeneous  behavior  of  a kind.  In  particular,  although 
the  general  level  about  which  fluctuations  are  occurring 
may  be  different  at  different  times,  the  broad  behavior 
of  the  series,  when  differences  in  level  are  allowed  for, 
may  be  similar.  Such  behavior  may  be  represented  by  a 
generalized  autoregressive  operator  Q(B),  in  which  one  or 
more  of  the  zeros  of  the  polynomial  Q(B),  i.e.,  one  or 
more  of  the  roots  of  the  equation  Q(B)=0,  is  unity. 

Thus,  the  operator  Q(B)  can  be  written:  Q (B)  =<p  (B)  (1-B)  a , 

where  <J>  (B ) is  the  stationary  operator.  The  general  model. 
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which  can  represent  homogeneous  nonstationary  behavior 
is  of  the  form: 


Q(B)gt  = <J>(B)(1-B)dgt  = 6(B)at 


that  is 


ij)  (B)  wfc  = 0 (B)  afc 


(B . 8 ) 


where 


w 


t 


(B . 9 ) 


Homogeneous  nonstationary  behavior  can  therefore  be 
represented  by  a model  which  calls  for  the  d'th  dif- 
ference of  the  process  to  be  stationary.  In  practice  d 
is  usually  0,1/  or  2 (1:11).  The  process,  described  by 
(B . 8 ) and  (B.9),  provides  a powerful  model  for  describing 
stationary  and  nonstationary  time  series  and  is  called  an 
autoregressive  integrated  moving  average  (ARIMA)  process 
of  order  (p,d,q) . The  process  is  defined  by: 


wt  = Vt-1+* 


.+<)>  w.  +a.-01a.  a. 

p t-p  t 1 t-1  q t-q 


(B.10) 


If  wfc  is  replaced  by  gt_U/  when  d=0,  the  model  (B.10) 
includes  the  stationary  mixed  model  (B.7)  as  a special 


case  and  the  pure  moving  average  model  (B.6).  The  word 
integrated  is  included  in  the  ARIMA  title  because  the 
relationship  which  is  the  inverse  to  (B.9)  is:  zt=Sdwt, 

where  S is  the  summation  operator  defined  by: 


Sw^  = 


oo 

E“t-b 


Wt+Wt-l+”t-2+' 


b=0 


Thus,  the  general  ARIMA  process  may  be  generated  from  white 
noise  afc  by  means  of  three  filtering  processes  as  follows: 


The  first  filter  has  input  afc,  transfer  function  9(B)  and 
output  efc,  where: 


et  = 


. -0  a. 
q t-q 


(B. 11) 


= 9(B)at 


The  second  filter  has  input  efc,  transfer  function  4>-1(B) 
and  output  wfc,  where: 


. . V 


. t ■ 
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w.  = <(> ,wt  ,+...+  w^  +et 
t Y1  t-1  p t-p  t 


= 4>-1(B)e. 


Finally,  the  third  filter  has  input  wfc  and  output  gfc 
according  to  (B.10)  and  has  transfer  function  Sd.  For  a 
detailed  discussion  of  a special  form  of  the  model  to 
employ  representing  seasonal  time  series  see  Box  and 
Jenkins,  Chapter  9. 


Transfer  Function  Models 

An  important  type  of  dynamic  relationship  between 
a continuous  input  and  a continuous  output  is  that  in 
which  deviations  of  input  X and  output  Y,  from  appropriate 
mean  values,  are  related  by  a linear  differential  equation 
of  the  form: 


(1+E1D+. . .+SRDR)Y(t)  = (HQ+H1D+. . .+HgDS) X (t-x)  (B.13) 


where  D is  the  differential  operator  d/dt,  the  E's  and 
Hs  are  unknown  parameters,  and  t is  a parameter  which 
measures  the  lead  time  or  pure  delay  between  input  and 
output.  A simple  example  of  (B.13)  would  be  a system 
where  the  rate  of  change  in  the  output  was  proportional 
to  the  difference  between  input  and  output  so  that: 
EdY/dt=X-Y  and  hence  (1+ED)Y=X.  Similarly,  for  discrete 


v i L18* 
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■1 


data,  the  transfer  function  between  an  output  Y and  an 
input  X,  each  measured  at  equispaced  times,  is  given  by 
the  difference  equation: 


(l+s _A+..  . + £ Ar)  Y.  = (n„+n,  A+.  . . + n AS)X_  . (B.14) 

l r t 0 1 s t-b 


in  which  the  differential  operator  D is  replaced  by  the 
difference  operator  A.  An  expression  of  the  form  (B.14) 
containing  only  a few  parameters  (r<2,  s<2) , may  often  be 
used  as  an  approximation  to  a dynamic  relationship,  whose 
true  nature  is  more  complex. 

The  linear  model  (B.14)  may  be  written  equivalently 
in  terms  of  past  values  of  the  input  and  output  by  substi- 
tuting B=l-A  in  (B.14),  that  is: 

(l-51B-...-6rBr)Yt  = (w0-wlB-...-wsBS)Xt_b 


(B. 15) 


= (w0Bb-WlBb+1-. . .-wsBb+S)Xt 


or 


5(B)Yt  = w(B)B  Xfc  = n(B)Xfc 


Alternatively  stated,  output  Yfc  and  input  X are  linked  by 
a linear  filter: 


4 


60 


Yt  ~ Y0Xt+YlXt-l+Y2Xt-2+' 


= Y (B)  X. 


for  which  the  transfer  function 


y(b)  = y0+y1b+y2b  +. . . 


can  be  expressed  as  a ratio  of  two  polynomials: 


(B. 17) 


Y(B)  = JJ(B)/6(B)  = 6_x(B)n(B) 


The  linear  filter  is  said  to  be  stable  if  (B.17)  converges 
for  | B | <1 . The  series  of  weights  Yq  ' ' ^2 ' ' ’ * ' 

appear  in  the  transfer  function  (B.17)  is  called  the 
inpulse  response  function.  Note  that  for  the  model  (B.14), 
the  first  b weights  Yq ' Y1 ' ' ' ' Yb-1  are  zero* 

The  transfer  function  (B.15)  enables  a reinterpre- 
tation of  the  stochastic  models  (B.6)  and  (B.7).  The 
disturbance  occurring  in  some  output  g will  often  have 
originated  elsewhere  in  some  variable  with  which  g is 
dynamically  linked  by  an  equation  of  the  form  (B.14) 

(1:14).  It  might  be  expected  that  the  complex  stochastic 
behavior  of  a random  variable  gt  might  be  expressed  in 
terms  of  another  random  variable  afc,  having  simpler 
properties,  by  a relationship 


* 
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6 (B)g  = fl(B)afc.  (B . 18 ) 

The  stochastic  models  previously  considered  are  of  this 
kind,  with  afc  a source  of  white  noise.  Since  (B.18) 
may  be  written: 


gt  = Q-1  (B)  9 (B)  afc  , 

it  is  assumed  that  g could  be  generated  by  passing  white 
noise  through  a linear  filter  with  a transfer  function 
Q-1 (B) 0 (B) . 

In  summary: 

1.  A dynamic  relationship  connecting  an  output  Y 
and  input  X in  terms  of  a linear  filter  is  given  by: 


Yt  = 


Y0Xt+YlXt-l+Y2Xt-2+‘  * • 


= Y(B)Xfc 

where  y (B)  is  the  transfer  function  of  the  filter. 

2.  In  turn,  y(B)  can  frequently  be  represented 
with  brevity  and  with  sufficient  accuracy  by  a ratio  of 
two  polynomials  of  low  degree  in  B:  y(B)=6  '*'(B)n(B)  so 
that  the  dynamic  input-output  equation  may  be  written 

6(B)Y  = fl(B)X  . 


3.  It  is  postulated  that  a series  gt,  in  which 
successive  values  are  highly  dependent  can  be  represented 
by  passing  white  noise  afc  through  such  a dynamic  system 
in  which  certain  of  the  roots  of  6(B)=0  are  unity.  This 
notion  yields  the  autoregressive  integrated  moving  average 
model : 


Q(B)gt  - 0(B)at. 

For  further  mathematical  detail  about  the  models , see 
Box  and  Jenkins,  particularly  Chapters  3,  4,  and  9. 


APPENDIX  C 


BUILDING  FILES  ON  THE  CREATE  SYSTEM 

The  program  contained  in  this  appendix  was  designed 
to  build  files  that  can  be  used  by  the  AFIT.LIB  Time 
Series  Analysis  programs.  Data  may  be  entered  into  the 
program  directly  from  the  terminal  or  may  be  generated 
from  random  number  generators  which  are  built  into  the 
program.  The  file  may  be  created  in  either  the  format 
that  is  required  for  time  series  analysis  model  identifi- 
cation (AFIT . LIB/UNIDEN , R)  or  for  time  series  analysis 
model  parameter  estimation  and  forecasting  (AFIT.LIB/ 

UNEST , R) . Detailed  instructions  for  using  the  program 
are  contained  in  Appendix  E. 
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APPENDIX  D 


RUNNING  THE  TIME  SERIES  ANALYSIS 
PROGRAMS  ON  CREATE 

This  appendix  contains  examples  of  the  control 
program  used  to  run  the  Box  and  Jenkins  computer  programs, 
the  example  data  decks,  and  comment  programs  on  the 
CREATE  computer  system. 
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This  appendix  is  designed  to  assist  in  the  use 
of  the  computer  programs  on  the  CREATE  system  that  con- 
stitute the  Time  Series  Analysis  package.  These  programs 
are  in  the  AFIT.LIB  and  are  run  in  the  CARDIN  system. 

This  appendix  is  divided  into  five  sections:  (1)  build- 

ing input  files;  (2)  the  use  of  AFIT.LIB/UNIDEN,R  for 
model  identification;  (3)  guides  to  model  identification; 

(4)  estimation  of  parameters;  and  (5)  the  use  of  AFIT.LIB/ 
UNEST , R for  parameter  estimation  and  forecasting. 

Building  Input  Files 

A great  deal  of  difficulty  was  initially  encountered 
in  attempting  to  build  files  that  could  be  read  by  the  Box 
and  Jenkins  time  series  analysis  programs.  The  difficulty 
was  overcome  by  examining  the  main  programs  to  determine 
data  format  and  placement  requirements  and  then  by  build- 
ing a computer  program  to  create  files  in  the  required 
format.  The  terms  "create  a file"  were  abbreviated  to 
the  computer  program's  name  CAFILE  which  can  be  found 
under  SL . LIB/ CAFILE , R.  The  file  read  by  the  time  series 
analysis  programs  contains  both  parameters  and  data  ele- 
ments. The  data  elements  can  be  entered  in  two  ways  in 

100 


i 


101 


the  CAFILE  program:  (1)  directly  from  the  terminal,  or 

(2)  by  using  a random  generator.  The  parameters  in  the 
file  are  dependent  upon  which  time  series  computer  program 


the  file  is  being  built  for.  Many  of  the  parameters  are 
the  same  from  program  to  program  but  are  placed  differ- 
ently in  CAFILE  for  each  program. 

CAFILE  can  be  used  to  build  files  for  input  into 
AFIT. LIB/UNIDEN, R and  AFIT.LIB/UNEST,R.  A series  of  ques- 
tions in  the  program  insure  the  correct  placement  of  the 
parameters  in  the  file.  The  first  few  times  in  using 
CAFILE,  it  is  recommended  that  assistance  be  requested 
when  the  program  asks  "Dc.  you  want  instructions  on  file 
building?"  Explanations  of  such  terms  as  TLAM,  TM, 

NDIFAC , NAPI,  NAC,  NPAC , MCSE , ILPRINT , ND , IOD,  NCHI, 
etc. , are  presented.  After  gaining  famiiiarlity  with  time 
series  analysis  parameters,  the  shortened  version  may 
be  used  to  build  a file  in  a few  minutes. 

The  CAFILE  program  is  designed  to  handle  500 
data  elements,  the  capacity  of  the  time  series  analysis 
programs . 

Before  using  CAFILE,  it  is  necessary  to  create 
a file  name  with  enough  space  to  put  the  output  of  CAFILE 
into.  This  can  be  easily  accomplished  using  the  ACCESS 
system  to  create  file  space  on  disk  storage.  If  this  is 
not  accomplished,  the  program  will  "bomb." 


V 


The  CAFILE  program  runs  in  the  following  manner: 

1.  The  program  asks  if  this  is  your  first  experi- 
ence with  time  series  analysis. 

2.  The  program  asks  if  you  are  building  a file 
for  forecasting.  If  the  answer  is  "yes,"  then  the  file 
will  be  built  for  input  into  AFIT/UNEST,R.  If  the 
answer  is  "no,"  then  a file  will  be  built  for  input 
into  AFIT/UNIDEN , R . 

3.  The  program  asks  for  the  number  of  observa- 
tions and  what  format  is  desired.  If  a random  generator 
is  used  to  create  data  elements,  the  most  desirable 
format  is  (v) . 

4.  The  program  asks  for  the  file  name  where  the 
data  will  be  written.  As  mentioned  earlier  if  the  file 
is  not  available,  the  program  will  "bomb." 

5.  The  program  asks  for  the  values  of  the  spe- 
cific parameters  required  by  the  time  series  program. 

6.  The  program  asks  if  data  will  be  entered  from 
the  terminal  or  from  a random  generator.  If  a random 
generator  is  asked  for,  the  following  choices  are  avail- 
able : 

a.  A Poisson  process  generator  with  a mean 

of  ten. 


b.  A linearly  increasing  Poisson  process 
generator.  The  mean  is  initially  set  at  10  and  then  is 
incremented  by  one-twelfth  for  each  newly  generated  data 


element. 
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c.  A linearly  decreasing  Poisson  process 
generator.  The  mean  is  initially  set  at  40  and  then  is 
decremented  by  one-sixth  for  each  newly  generated  data 
element. 

d.  An  alternating  linear  Poisson  process 
generator.  The  mean  increases  for  the  first  twelve  data 
elements  and  then  decreases  for  the  next  six  data  ele- 
ments and  then  continues  to  increase  and  decrease  in 
the  same  manner  to  the  120th  data  element,  thereafter 
continuing  to  increase  with  each  successive  data  element. 

e.  A sine  process  generator.  The  equation 

is:  20Sine  (Poisson) +50 . 

7.  After  generating  the  data  elements  or  receiv- 
ing them  from  the  terminal,  the  program  will  ask  if  a 
forecasting  file  is  being  built.  If  the  answer  is 
"yes,"  the  file  will  be  written  to  the  previously  desig- 
nated file  in  the  proper  format  for  input  into  AFIT/UNEST , R. 
If  the  answer  is  "no,"  the  file  will  be  written  to  the 
previously  designated  file  in  the  proper  format  for  input 
into  AFIT/UNIDEN , R. 

8.  The  program  will  then  ask  if  you  are  building 

another  file.  If  the  answer  is  "yes,"  the  program  returns 
to  the  beginning.  A word  of  caution:  should  a file  using 

a random  generator  need  to  be  replicated,  it  must  be  done 
exactly  the  same  way  as  the  original . The  easiest  way  to 


insure  replicability  is  to  return  to  the  * level  and 
the  program  again  (see  the  program  contained  in  Appen 
dix  C)  . 


The  Use  of  AFIT.LIB/UNIDEN,R 


for  Model  Identification 


The  dimension  limitations  of  the  UNIDEN  program 


500  Observations 

150  Autocorrelations 

150  Partial  Autocorrelations 

2 Autocorrelation  functions  of  differences 
of  the  original  series. 

The  variables  in  the  program  which  must  be  sup- 
plied by  the  user  are: 

FMZ  The  format  specification  (max  size  =20). 

ILDID  The  data  list.  Set  = 0 to  suppress  data. 

IOD  An  array  containing  NDIFAC  orders  of  dif- 
ferences (d)  of  each  type  desired,  i.e., 
the  values  of  s in  the  expression  (1-BS) . 

IPDID  The  plotting  data  function  in  the  program 
Set  = 0 to  suppress. 

IWTPA  The  autocorrelation  plotting  function 
of  the  program.  Set  = 0 to  suppress. 

MCSE  The  standard  error  of  autocorrelation 
calculating  function.  Set  = 0 to  sup- 
press calculation. 


a • 
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MPRINT  A function  for  output  of  the  statistics 
in  the  program.  Set  = 0 to  suppress 
listing. 

NAC  The  number  of  autocorrelations. 

NAPL  The  number  of  autocorrelations  to  be 
printed  per  line,  between  1 and  12 
inclusive. 

NCHI  The  number  of  autocorrelations  to  be 
used  in  calculating  a chi-square  sta- 
tistic. Set  = 0 if  not  desired. 
Obviously,  the  maximum  dimension  of 
NCHI  is  NAC. 

ND  An  array  containing  the  NDIFAC  numbers  of 
differences  of  each  type  desired.  The 
autocorrelation  function  will  be  cal- 
culated for  the  ND  (1 ) +1  series. 

ND (2) , NC ( 3) , etc.,  are  used  to  form 
another  "original"  series  from  the  data 
matrix  entered  in  G . The  minimum  size 
is  the  value  of  NADIFAL.  Warning:  Some 

graph  titles  will  not  he  right  if  ND(1) 
exceeds  five. 

NDIFAC  The  number  of  difference  factors  or 
types.  The  autocorrelation  function 
is  calculated  for  the  original  series  and 
each  requested  difference  of  type  1.  Any 
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difference  factors  beyond  one  are  used  to 
difference  the  original  series  to  form  a 
new  "original"  series. 

NOB  The  number  of  observations. 

NPAC  The  number  of  partial  autocorrelations. 

The  maximum  value  is  NAC. 

SERIES  An  array  of  80  spaces  to  place  the  title 
describing  the  data  or  the  analysis. 

TLAM  A data  transformation  parameter.  If 

TLAM  is  set  equal  to  zero,  the  original 
data  elements  will  be  transformed  into  a 
series  with  ln(g^+TM),  i=l , 2 , . . . ,NOB  as 
the  data  elements.  If  TLAM  is  set  equal 
to  one,  the  data  elements  are  left  as 
input.  Any  values  greater  than  one, 
the  series  will  be  transformed  as  follows: 
(gi+TM)TLAM,  i=l , 2 , . . . ,NOB. 

TM  A data  transformation  parameter  which 
is  added  to  each  data  element  before 
it  is  transformed  in  accordance  with  TLAM. 

In  using  CAFILE,  much  of  the  above  information 
is  repeated  with  each  parameter  if  assistance  is  requested 
when  the  program  asks,  "Do  you  want  instructions  on  file 
building?"  Output  of  UNIDEN  of  the  three  series  is  con- 
tained in  Tables  E.l  to  E.9  and  Figures  E.l  to  E.15. 
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Fig.  E.9.  PACF  of  Observed  Linearly  Decreasing  Poisson  Series 
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Guides  to  Model  Identification 

Model  identification  was  the  most  difficult  time 
series  analysis  task  in  the  research  effort.  Although 
several  references  on  time  series  analysis  are  available, 
they  proved  to  be  of  little  value.  Most  of  the  references 
made  little  effort  to  fill  the  gap  between  descriptions 
of  various  models  and  the  forecasting  by  the  model  once 
it  had  been  identified.  The  remainder  of  this  section 
will  be  devoted  to  model  identification.  It  is  a 
compilation  of  the  experience  gained  in  this  research 
effort. 

The  autocorrelation  and  partial  autocorrelation 
functions  are  very  valuable  tools  in  the  identification 
of  possible  models.  The  Tables  E.10  and  E.ll  and  Fig- 
ures E . 16  to  E . 25  are  offered  as  assistance  in  identifying 
possible  models. 

Model  identification  and  estimation  overlap 
(1:173).  Employment  of  estimation  procedures  was  used 
to  carry  out  part  of  the  identification.  Identification 
is  necessarily  somewhat  inexact  since  a model  is  being 
built  to  fit  the  data.  It  is  in  this  stage  that 
graphical  methods  and  judgement  coming  from  experience 
are  particularly  useful.  It  should  be  noted  that  pre- 
liminary identification  does  nothing  but  tentatively 
identify  a class  of  models  which  can  later  be  more 
closely  fitted  to  the  data  and  checked  for  consistency. 
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TABLE  E . 10 


BEHAVIOR  OF  AUTOCORRELATION  FUNCTIONS 
WITH  PARAMETER 


Model 

Behavior  of  pk 

AR 

(l,d, 0) 

Decays  exponentially  (see  Figure  E.16) 

MA 

(0 ,d, 1) 

Only  p^  is  nonzero  (see  Figure  E.17) 

1 / 

AR 

(2 ,d, 0) 

Mixture  of  exponentials  of  damped  sine 
wave  (see  Figure  E.  18) 

f i 

MA 

(0 ,d, 2) 

Only  p1  and  p2  are  nonzero  (Figure  E.19) 

/ 

t # 
f ** 

ARIMA ( 1 , d , 1) 

Decays  exponentially  from  first  lag 
(see  Figure  E.20) 

N 

\ 

TABLE  E.ll 

( ' 

BEHAVIOR  OF  AUTOCORRELATION  FUNCTIONS 
WITH  PARAMETER  4>kk"  - 

L 

Model 

Behavior  of  <j>kk 

AR 

(l,d,0) 

Only  is  nonzero  (see  Figure  E.21) 

MA 

(0 ,d, 1) 

Exponential  decay  (tail  off)  (see 
Figure  E.22) 

AR 

(2 ,d, 0) 

Only  <f)i:L  and  4>22  are  nonzero  (see 
Figure  E.25) 

MA 

(0 ,d, 2) 

Designated  by  mixture  of  exponentials 
damped  sine  wave  (see  Figure  E.24) 

ARIMA 

(1 ,d, 1) 

Dominated  by  exponential  decay  from 
first  lag  (4>11  ) (see  Figure  E.25) 

142 


Fig.  E.25.  Partial  Autocorrelogram  of  A ARIMA  (l,d,l) 

Process 
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P'1 


The  estimated  autocorrelation  function  of  the  increasing 
linear  Poisson  process  (see  Figure  E.  2)  tends  to  die  out 
quickly,  and  this  indicates  that  the  degree  of  dif- 
ferencing is  zero  (i.e.,  d=0)  . Since  both  the  autocor- 
relation function  and  partial  autocorrelation  function  had 
an  exponential  appearance  the  partial  autocorrelation 
function  tailed  off  (see  Figures  E.2  and  E.4),  a mixed 
autoregressive-moving  average  process  of  the  form  ARIMA 
(1,0,1)  was  selected  as  the  model.  Similarly,  in  the 
examination  of  the  correlegrams  for  the  linearly  decreas- 
ing and  alternating  linear  Poisson  patterns  (see  Fig- 
ures E.7,  E.9,  E.12  and  E.14),  the  same  exponential  appear- 
ance with  dampened  sine  patterns  was  observed  in  the 
autocorrelation  function  with  a cutoff  of  the  partial 
autocorrelation  function  after  the  first  lag.  The  ARIMA 
(1,0,1)  form  was  again  suggested. 

Once  a model  has  been  tentatively  selected,  the 
next  step  is  to  estimate  the  parameters. 

Estimation  of  Parameters 

Initial  Estimates  for  Autoregressive  Processes. 

For  an  assumed  AR  process  of  order  1 or  2 , initial  esti- 
mates for  the  parameters  <f>^  and  <t> ^ can  be  calculated  by 
substituting  estimates  r^  for  the  theoretical  autocorrela- 
tions p,  . For  an  AR(1)  : 

D 
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11 


r 


1 


where  is  the  first  autocorrelation  value  in  the  out- 
put from  UNIDEN.  For  an  AR(2) : 


21 


rl  (1-r2^ 


and 


(E . 1 ) 


where  <f>  . denotes  the  estimate  of  the  b 
Ypb 


th 


autoregressive 
parameter  in  an  autoregressive  process  of  order  p,  and 


r2  is  the  second  autocorrelation  value  in  the  output 
from  UNIDEN  (see  Tables  E.2,  E.5,  E.8).  (The  above 
results  are  obtained  from  the  Yule-Walker  equations.) 


Initial  Estimates  for  Moving  Averages.  The 
general  equation  for  moving  average  is : 


-VMk+l+Mk+2+-  • -+Vk^q 

aO  /\  9 a o • 

(l+4>^+<t>2+.  ' "+<^q^ 


The  reader  is  referred  to  Table  D,  page  517,  in  the  Box 
and  Jenkins  text  to  obtain  preliminary  estimates  for  a 
moving  average  process  of  q=l.  will  be  related  to 

0^  and  by  substituting  r^ (w)  for  p^,  the  Table  can  be  used 
to  provide  initial  estimates  for  an  (0,d,l)  process. 
Similarly,  Chart  C,  page  519,  relates  p^,  and  p2,  to 
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9^  and  02*  by  substituting  r^  (w)  and  ^ (w)  for  p^,  and  p^, 
initial  estimates  for  a (0/d, 2)  process  can  be  obtained. 

Initial  Estimates  for  a Mixed  Autoregressive 
Average  Process.  For  order  p=l,d,  q=l,  estimates  can  be 
obtained  by  substituting  r^  (w)  and  ^ (w)  for  p^,  and  p2 
in  the  following  equation: 


d-e  4>  ) (<^-9. ) 

r,  = — — — (E.3) 

■**  /\  O A A 

1+0^  - 2<j>^0^ 


These  equations  can  easily  be  solved,  since  r^ 
and  r2  are  the  first  and  second  autocorrelation  values 
found  in  UNIDEN  (see  Tables  E.2,  E.5,  and  E.8).  Chart  D 
(1:520)  in  the  Box  and  Jenkins  text  relates  p^  and  p ^ 
to  <j>^  and  9^  which  can  be  used  to  provide  initial  esti- 
mates of  the  parameters  for  any  ARIMA  (l,d,l)  process. 
For  example,  the  estimates  for  a linearly  increasing 

A 

series  were  computed  to  be:  r^=37,  r2=.28  <j>^=.76,  and 
0^=.48.  The  program  provided  estimates  of  the  mean  and 

* *2 

variance  of  the  series  to  be  u=14.7  and  c =20.25  (see 


Table  E.2) 
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Other  examples  are: 

Linearly  Decreasing  Poisson  Pattern  (see  Table  E.5) 

A A A A A 

r1=. 49 , r =.49,  ♦ =1,  0^1,  p=30.05,  and  o =51.84. 

Alternating  Linear  Poisson  Pattern  (see  Table  E.8) 

A A A A A 

r1=.53,  r2=.42,  <^=.79,  01=.3O,  y=.28.20,  and  o =43.30. 

These  estimates  and  parameters  for  the  models  were  built 
into  a forecasting  file  using  the  file  building  computer 
program  in  Appendix  C. 

The  Use  of  AFIT.LIB/UNEST,R 
for  Parameter  Estimation  and 
Forecasting 

The  dimension  limitations  of  AFIT.LIB/UNEST,R, 
hereafter  referred  to  as  UNEST,  are: 

25  Parameters 

150  Autocorrelations  of  residuals 

150  Partial  autocorrelations  of  residuals 

300  Forecasts 

5 Time  origins  for  forecasts 
25  New  observations  to  use  in  calculating  update 
forecasts 

500  Observations  in  series 

800  Total  observations  and  forecasts 

200  Maximum  backorders  on  either  side  of  model 

The  variables  in  the  program  which  are  user  supplied 
include  those  used  in  UNIDEN.  The  following  list  includes 
only  new  variables  or  those  with  a different  use  in  UNEST: 
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EPSI 


ESP2 


ICI 


ILDEST 


ILDFCA 


INC 


IOPA 


IPDEST 


The  maximum  change  in  the  relative  sum  of 
squares  before  iteration  in  calculation  of 
parameter  stops.  Set  = .00  to  suppress 
function. 

The  maximum  relative  change  in  each  param- 
eter before  iteration  stops.  Again  it  can 
be  suppressed  by  setting  it  equal  to  .00. 
The  control  on  the  wiuch  of  the  confidence 
limits  for  the  forecasts.  The  values  of 
1,  2,  3,  4,  5 relate  to  50,  75,  90,  95, 

99  percent  limits  respectively. 

The  listing  function  of  the  program  for  the 
estimation  routines.  It  can  be  suppressed 
by  entering  0 for  ILDEST. 

The  listing  of  data  by  forecasting  routine. 
It  can  be  suppressed  by  setting  equal  to  0. 
The  array  containing  MFAC (1) +MFAC (3) +2 
numbers  of  each  of  the  specified  types  of 
parameters  in  the  model  to  be  used. 

An  array  containing  the  order  of  each 
parameter  from  left  to  right  in  the  time 
series  model  being  used  (these  are  the 
powers  of  the  B operators) . 

The  plotting  of  data  estimation  routine. 

It  can  be  surpressed  by  setting  equal  to 


zero. 
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IPDFCA  The  plotting  of  the  data  from  the  forecast- 
ing routine.  Set  = 0 to  suppress. 

IPRES  The  plotting  of  residuals  routine.  It 
can  be  suppressed  by  setting  equal  to 
zero. 

IWTPA  The  plotting  of  the  residual  autocorre- 
lations. It  can  be  suppressed  by  setting 
equal  to  zero. 

MCSE  The  function  which  calculates  the  standard 
errors  of  the  residual  autocorrelation.  It 
can  be  suppressed  by  setting  equal  to  zero. 

MFAC  An  array  of  size  3 where  the  orders  of  the 
ARIMA  (p , d , q)  model  are  enterd.  MFAC(l) 
is  the  number  of  autoregressive  factors; 
MFAC (2)  is  the  number  of  differencing  fac- 
tors; MFAC (3)  is  the  number  of  moving  aver- 
age factors  in  the  time  series  model. 

MIT  The  maximum  number  of  iterations  in  the 

estimation  routine.  The  maximum  that  will 
be  accepted  by  the  program  is  999. 

NF  The  number  of  forecasts. 

NT  The  array  containing  the  forecast  time 
origins . 

The  number  of  time  origins  to  use  in  fore- 
casting. The  forecasts  (NF)  are  made  from 


NTO 


\9 


each  of  the  time  origins.  Warning:  some 

graph  titles  may  not  turn  out  if  NTO 
exceeds  9. 

NU  The  number  of  new  observations  to  be  used 
in  updating  the  forecasts. 

PA  The  array  where  the  initial  estimates 
of  the  parameters  are  stored  for  the 
estimation  routines. 


The  use  of  CAFILE  will  greatly  simplify  the  file 
building  process  for  use  in  UNEST.  Most  of  the  above 


information  is  repeated  in  CAFILE  if  the  user  requests 
assistance  in  building  forecasting  files. 

After  the  model  has  been  identified  and  input 
into  UNEST  with  the  required  parameters , UNEST  iteratively 
determines  the  efficient  parameter  estimates  (see  Tables 
E . 12  and  E.14).  Once  the  output  has  been  obtained,  the  model 
can  be  diagnostically  checked  against  the  data  used  to 
construct  the  model  to  determine  if  it  is  appropriate.  Two 
possible  diagnostic  checks  built  into  the  program  are: 

(1)  overfitting  and  (2)  analysis  of  the  residuals. 

Overfitting  involves  adding  an  additional  param- 
eter to  an  estimated  model  and  testing  the  hypothesis  that 
the  additional  parameter  equals  zero.  Such  a test  could 

A A 

be  made  using  the  fact  that  the  ratio  (B-B)/SE(B),  where 

/\  /\ 

SE (B)  is  the  standard  error  of  B,  is  approximately  normally 


jLJL) 
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distributed,  and  B is  some  parameter  estimate  being  tested. 

A 

The  hull  hypothesis  that  B=0  can  be  tested  by  comparing 
the  above  ratio  with  the  critical  value  of  the  t distribu- 
tion at  the  appropriate  level  of  confidence.  In  the 
UNEST  program,  the  confidence  levels  are  established  by 
the  user  in  the  parameter  ICI.  To  test  for  overfitting, 
a simple  check  can  be  accomplished  by  using  the  summary  of 
the  model  found  in  the  output  of  UNEST  (see  Tables  E.12 
to  E.14).  The  confidence  intervals  about  the  estimated 
value  of  the  parameter  cannot  contain  zero.  If  they  do, 
the  model  is  overfitted  and  the  redundant  parameters  should 
be  removed. 

A 

Analysis  of  the  residuals  a^  is  a result  of  the 
idea  that  if  the  observed  series  is  stationary,  there 
will  be  no  autocorrelation  among  the  residuals.  For 
example,  a large  value  r^  in  the  residual  autocorrela- 
tion function  may  indicate  that  an  addition  of  another 
moving  average  parameter  might  be  justified  in  the  model 
(see  Tables  E.15  to  E.20  and  Figures  E.26  to  E.34). 

To  test  whether  a series  of  autocorrelations 
is  derived  from  a set  of  random  observations  (white  noise) , 
the  Q statistic  may  be  used  where: 
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Fig.  E . 32 . PACF  of  Residuals  for  Linearly  Increasing  Poisson  Process 
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The  computed  value  of  Q may  be  compared  with  the  Chi- 
square  value  having  k-p-q  degrees  of  freedom  to  deter- 
mine if  a set  of  sample  autocorrelations  is  significantly 
large.  Using  UNEST,  the  Chi-square  value  calculated  on 
the  autocorrelation  function  of  the  residuals  is  compared 
with  the  Chi-square  critical  value  obtained  from  Chi- 
square  tables  at  an  appropriate  confidence  level  (refer 
to  Tables  E.18  to  E.20  and  Figures  E.29  to  E.31).  If 
the  computed  Chi-square  value  is  less  than  the  critical 
Chi-square  value,  it  can  be  concluded  that  the  autocorrela- 
tion among  the  residuals  is  not  significant  and  there 
is  no  indication  of  a lack  of  fit. 

Should  either  of  the  the  two  diagnostic  checks 
fail,  the  model  identification  process  should  be  repeated. 
The  Box  and  Jenkins  time  series  analysis  for  forecasting 
methods  are  not  a hands-off  technique  for  forecasting 
future  observations.  The  user  of  time  series  analysis 
is  required  to  follow  the  above  procedures,  iteratively, 
until  the  correct  model  is  built  to  forecast  future  obser- 
vations . 

The  time  series  models  are  useful  in  the  model 
building  process,  i.e.,  building  a model  to  fit  the  data. 
Once  the  model  fits,  forecasts  should  be  excellent,  exhibit- 
ing no  bias  (see  Tables  E.15  to  E.20  and  Figures  E.26  to 
E.28).  Time  series  analysis  techniques  also  offer  the 
advantage  of  being  able  to  calculate  confidence  intervals 
about  the  forecasted  observations. 
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APPENDIX  F 


RESULTS  OF  STATISTICAL  HYPOTHESIS  TEST 

Complete  Outputs  of  Single  Moving 
Average  Forecasts 

This  appendix  contains  the  tables  of  outputs  of  the 
single  moving  average  forecasts  and  the  time  series 
analysis  forecast.  For  each  data  pattern,  the  following 
information  is  given: 

1.  Thirty-six  forecasts  of  each  demand  pattern 

2.  The  sum  of  the  forecast  errors  associated 
with  each  lead  time 

3.  The  average  forecast  error  for  each  lead  time 
(the  forecast  bias) 

4.  The  standard  deviation  of  each  run  of  fore- 
cast errors 

5.  The  statistic: 

Average  Forecast  Error 
Standard  Deviation/36 
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TABLE  F . 2 

STATISTICAL  RESULTS  OF  POISSON  SERIES 
AT  DIFFERENT  LEAD  TIMES 


Lead 

Time 

Error  Sum 

Bias 

Standard 

Deviation 

Test 

Statistic 

1 

-0.0417 

-0.0012 

3.3402 

-0.0021 

3 

-2.0417 

-0.0567 

3.2499 

-0.1047 

6 

1.9583 

0.0544 

3.1362 

0.1041 

9 

-4.0417 

-0.1123 

3.1430 

-0.2143 

12 

-5.0417 

-0.1400 

3.2071 

-0.2620 

15 

-2.0417 

-0.0567 

3.2108 

-0.1060 

18 

-3.0417 

-0.9845 

2.9453 

-0.1721 

21 

-5.0417 

-0.1400 

3.0597 

-0.2746 

24 

-7.0417 

-0.1956 

3.0246 

-0.3874 

27 

3.9583 

0.1100 

2.6237 

0.2514 

30 

-2.0417 

-0.0567 

2.5081 

-0.1357 

33 

-9.0417 

-0.2512 

2.5214 

-0.5977 

36 

-5.0417 

-0.1400 

2.3804 

-0.3530 

TABLE  F . 4 


STATISTICAL  RESULTS  OF  LINEARLY  INCREASING 


SERIES 

AT  DIFFERENT 

LEAD  TIMES 

Lead 

Time 

Error  Sum 

Bias 

Standard 

Deviation 

Test 

Statistic 

1 

-40.333 

-1.204 

3.8808 

-1.7322 

3 

-46.333 

-1.7870 

3.8376 

-2.0177 

6 

-59.333 

-1.6481 

3.7398 

-2.6447 

9 

-63.333 

-1.7593 

3.8793 

-2.7210 

12 

-76.333 

-2.1204 

3.8636 

-3.2929 

15 

-98.333 

-2.715 

3.5932 

-4.5611 

18 

-98.333 

-2.7315 

3.5492 

-4.6177 

21 

-112.333 

-3.1204 

3.1398 

-5.9629 

24 

-124.333 

-3.4537 

3.0693 

-6.7514 

27 

-121.333 

-3.3704 

3.3839 

-5.9760 

30 

-155.33 

-3.2037 

3.3362 

-5.76117 

33 

-112.333 

-3.1204 

3.2574 

-5.7476 

36 

-127.333 

-3.5370 

3.2947 

-6.4414 
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TABLE  F . 6 

STATISTICAL  RESULTS  OF  LINEARLY  DECREASING 
SERIES  AT  DIFFERENT  LEAD  TIMES 


Lead 

Time 

Error  Sum 

Bias 

Standard 

Deviation 

Test 

Statistic 

1 

86.6250 

2.4063 

5.7637 

2.5049 

3 

106.6250 

2.9618 

5.5532 

3.2001 

6 

121.6250 

3.3785 

5.6996 

3.5565 

9 

148.6250 

4.1285 

5.5918 

4.4299 

12 

144.6250 

4.0174 

5.0051 

4.8159 

15 

175.6250 

4.8785 

5.1203 

5.7166 

18 

191.6250 

5.3229 

4.9096 

6.5051 

21 

191.6250 

5.3229 

5.3172 

6.0064 

24 

173.6250 

4.8229 

4.9029 

5.9021 

27 

181.6250 

5.0451 

5.0377 

6.0089 

30 

216.6250 

6.0174 

4.1752 

6.9764 

33 

232.6250 

6.4618 

5.0116 

4.8120 

36 

242.6250 

6.7396 

4.8120 

8.4034 
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TABLE  F . 8 

STATISTICAL  RESULTS  OF  ALTERNATING  LINEAR 
SERIES  AT  DIFFERENT  LEAD  TIMES 


Lead 

Time 


Error  Sum 


Bias 


Standard 

Deviation 


Test 

Statistic 


1 

-49.7083 

-1.3808 

4.0107 

-2.0657 

3 

-54.7083 

-15.197 

4.0929 

-2.2278 

6 

-52.7083 

-1.4641 

4.4057 

-1.9939 

9 

-52.7083 

-1.4641 

4.0864 

-2.1497 

12 

-60.7083 

-1.6863 

3.9587 

-2.5559 

15 

-85.7083 

-2.3808 

4.0947 

-3.4886 

18 

-91.7083 

-2.5475 

4.3702 

-3.4967 

21 

-99.7083 

-2.7697 

4.3407 

-3.8284 

24 

-98.7083 

-2.7419 

3.9847 

-4.1286 

27 

-124.7083 

-3.4641 

3.8577 

-5.3878 

30 

-141.7083 

-3.9363 

4.2133 

-5.6056 

33 

-147.7083 

-4.1030 

4.2193 

-5.8347 

36 

-150.7083 

-4.1863 

3.9492 

-6.3602 

fjpiiH.m  u||  i 


190 


j 

V 


TABLE  F.10 

STATISTICAL  RESULTS  OF  THE  SINE  SERIES 
AT  DIFFERENT  LEAD  TIMES 


Lead 

Time 

Error  Sum 

Bias 

Standard 

Deviation 

Test 

Statistic 

1 

106.7761 

2.9660 

14.8289 

1.2001 

3 

141.8972 

3.9416 

14.7415 

1.6043 

6 

172.2818 

4.7856 

14.4335 

1.9894 

9 

190.4720 

5.2909 

14.7721 

2.1490 

12 

192.0134 

5.3337 

14.7074 

2.1759 

15 

152.2264 

4.2285 

15.0838 

1.6820 

18 

199.4587 

5.5405 

14.6222 

2.2735 

21 

196.7626 

5.4656 

14.5475 

2.2543 

24 

171.3871 

4.7608 

14.5946 

1.9572 

27 

154.5975 

4.2944 

15.0652 

1.7103 

30 

127.1646 

3.5323 

14.3265 

1.4794 

33 

174.5166 

4.8477 

14.7789 

1.9681 

36 

179.4900 

4 9858 

14.9835 

1.9965 
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TABLE  F.ll 

OUTPUT  OF  LINEARLY  INCREASING  POISSON  PATTERN  TIME 
SERIES  FORECASTING  RESULTS 


Lead 

Time 

Error  Sum 

Bias 

Standard 

Deviation 

Test 

Statistic 

1 

12.2740 

.3409 

3.8075 

.5373 

3 

11.0137 

.3059 

3.8077 

.4821 

6 

5.1226 

.1423 

3.7529 

.2275 

9 

8.2314 

.2287 

3.776 

. 3632 

12 

2.3397 

.0650 

3.8187 

.1021 

15 

-12.5524 

-.3487 

3.6367 

-.5753 

18 

- 5.448 

-.1512 

3.5169 

-.2580 

21 

-12.3377 

-.3427 

3.1119 

-.6608 

24 

-17.2310 

-.4786 

3.0962 

-.9275 

27 

- 7.1248 

-.1979 

3.2977 

-.3601 

30 

5.9811 

.1661 

3.2108 

.3105 

33 

16.0864 

.4468 

3.1837 

.8421 

36 

8.1914 

.2275 

3.2669 

.4179 
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TABLE  F.12 

OUTPUT  OF  LINEARLY  DECREASING  POISSON  TIME 


SERIES 

FORECASTING 

RESULTS 

Lead 

Time 

Error  Sum 

Bias 

Standard 

Deviation 

Test 

Statistic 

1 

-12.5007 

-.5139 

5.6922 

-.5417 

3 

- 8.2154 

-.2282 

5.4838 

-.2497 

6 

- 7.5143 

-.2087 

5.5058 

-.2275 

9 

5.5079 

.1530 

5.5039 

.1668 

12 

-12.1560 

-.3317 

5.1270 

-.03952 

15 

5.4870 

.1524 

4.9738 

.1839 

18 

8.4301 

.2342 

4.6246 

.3038 

21 

- 4.2701 

-.1187 

5.0385 

-.1412 

24 

-34.7472 

-.9652 

4.6795 

-1.2376 

27 

-38.9439 

-1.0818 

4.7707 

-1.3605 

30 

-15.8667 

-.4407 

4.7822 

-.5530 

33 

-11.5217 

-.3200 

4.5831 

-.4190 

36 

-12.9152 

-.3588 

4.6439 

-.4635 
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TABLE  F. 13 

OUTPUT  OF  ALTERNATING  LINEAR  INCREASING  AND  DECREASING 
POISSON  TIME  SERIES  FORECASTING  RESULTS 


Lead 

Time 


Error  Sum  Bias 


Standard  Test 

Deviation  Statistic 


1 

18.2847 

.5079 

3.9042 

.7806 

3 

21.4741 

.5965 

3.8976 

.9183 

6 

35.7376 

.9933 

4.2802 

1.3924 

9 

48.0407 

1.3349 

4.1542 

1.9274 

12 

52.3237 

1.4534 

4.1720 

2.0903 

15 

39.6061 

1.1002 

4.0730 

1.7207 

18 

45.8831 

1.2747 

4.1671 

1.83333 

21 

50.1646 

1.3935 

4.1798 

2.0003 

24 

63.4456 

1.7624 

3.9926 

2.6485 

27 

49.7261 

1.3813 

3.9523 

2.0969 

30 

45.0060 

1.2502 

4.1951 

1.7850 

33 

51.2856 

1.4246 

4.1212 

2.0740 

36 

60.5650 

1.6824 

3.8631 

2.5995 
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